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f(z) + f22) + f8x) =0 


for all real x. 


This functional equation is satisfied by the function f(x) = 0, and also by 
the strange example graphed above. To find out more about this function, see 
Chapter 3. 
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Can you discover a function f(x) which satisfies this functional equation? 
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Preface 


Over the years, a number of books have been written on the theory of func- 
tional equations. However, few books have been published on solving func- 
tional equations which arise in mathematics competitions and mathematical 
problem solving. The intention of this book is to go some distance towards 
filling this gap. 

This work began life some years ago as a set of training notes for 
mathematics competitions such as the William Lowell Putnam Competition 
for undergraduate university students, and the International Mathematical 
Olympiad for high school students. As part of the training for these competi- 
tions, I tried to put together some systematic material on functional equations, 
which have formed a part of the International Mathematical Olympiad and a 
small component of the Putnam Competition. As I became more involved 
in coaching students for the Putnam and the International Mathematical 
Olympiad, I started to understand why there is not much training mate- 
rial available in systematic form. Many would argue that there is no theory 
attached to functional equations that are encountered in mathematics compe- 
titions. Each such equation requires different techniques to solve it. Functional 
equations are often the most difficult problems to be found on mathematics 
competitions because they require a minimal amount of background theory 
and a maximal amount of ingenuity. The great advantage of a problem involv- 
ing functional equations is that you can construct problems that students at 
all levels can understand and play with. The great disadvantage is that, for 
many problems, few students can make much progress in finding solutions even 
if the required techniques are essentially elementary in nature. It is perhaps 
this view of functional equations which explains why most problem-solving 
texts have little systematic material on the subject. Problem books in mathe- 
matics usually include some functional equations in their chapters on algebra. 
But by including functional equations among the problems on polynomials or 
inequalities the essential character of the methodology is often lost. 

As my training notes grew, so grew my conviction that we often do not do 
full justice to the role of theory in the solution of functional equations. The 
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result of my growing awareness of the interplay between theory and problem 
application is the book you have before you. It is based upon my belief that 
a firm understanding of the theory is useful in practical problem solving with 
such equations. At times in this book, the marriage of theory and practice is 
not seamless as there are theoretical ideas whose practical utility is limited. 
However, they are an essential part of the subject that could not be omit- 
ted. Moreover, today’s theoretical idea may be the inspiration for tomorrow’s 
competition problem as the best problems often arise from pure research. We 
shall have to wait and see. 

The student who encounters a functional equation on a mathematics con- 
test will need to investigate solutions to the equation by finding all solutions 
(if any) or by showing that all solutions have a particular property. Our em- 
phasis is on the development of those tools which are most useful in giving a 
family of solutions to each functional equation in explicit form. 

At the end of each chapter, readers will find a list of problems associated 
with the material in that chapter. The problems vary greatly in difficulty, 
with the easiest problems being accessible to any high school student who has 
read the chapter carefully. It is my hope that the most difficult problems are 
a reasonable challenge to advanced students studying for the International 
Mathematical Olympiad at the high school level or the William Lowell Put- 
nam Competition for university undergraduates. I have placed stars next to 
those problems which I consider to be the harder ones. However, I recognise 
that determining the level of difficulty of a problem is somewhat subjective. 
What one person finds difficult, another may find easy. 

In writing these training notes, I have had to make a choice as to the gen- 
erality of the topics covered. The modern theory of functional equations can 
occur in a very abstract setting that is quite inappropriate for the readership 
I have in mind. However, the abstraction of some parts of the modern theory 
reflects the fact that functional equations can occur in diverse settings: func- 
tions on the natural numbers, the integers, the reals, or the complex numbers 
can all be studied within the subject area of functional equations. Most of the 
time, the functions I have in mind are real-valued functions of a single real 
variable. However, I have tried not to be too restrictive in this. The reader will 
also find functions with complex arguments and functions defined on natural 
numbers in these pages. In some cases, equations for functions between circles 
will also crop up. Nor are functional inequalities ignored. 

One word of warning is in order. You cannot study functional equations 
without making some use of the properties of limits and continuous functions. 
The fact is that many problems involving functional equations depend upon 
an assumption of such as continuity or some other regularity assumption that 
would usually not be encountered until university. This presents a difficulty 
for high school mathematics contests where the properties of limits and conti- 
nuity cannot be assumed. One way to get around this problem is to substitute 
another regularity condition that is more acceptable for high school mathe- 
matics. Thus a problem where a continuity condition is natural may well get 
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by with the assumption of monotonicity. Although continuity and monotonic- 
ity are logically independent properties (in the sense that neither implies the 
other) the imposition of a monotonicity condition in a functional equations 
problem may serve the same purpose as continuity. Another way around the 
problem is to ask students to provide a weaker conclusion that is not “finished” 
by invoking continuity. Asking students to determine the nature of a function 
on the rational numbers is an example of this. Neither solution to this problem 
is completely satisfactory. Fortunately, there are enough problems which can 
be posed and solved using high school mathematics to serve the purpose. More 
advanced contests such as the William Lowell Putnam Competition have no 
such restrictions in imposing continuity or convexity, and expect the student 
to treat these assumptions with mathematical maturity. 

Some readers may be surprised to find that the chapter on functional 
equations in a single variable follows that on functional equations in two or 
more variables. However this is the correct order. An equation in two or more 
variables is formally equivalent to a family of simultaneous equations in one 
variable. So equations in two variables give you more to play with. I have 
had to be very selective in choosing topics in the third chapter, because much 
of the academic literature is devoted to establishing uniqueness theorems for 
solutions within particular families of functions: functions that are convex or 
real analytic, functions which obey certain order conditions, and so on. It 
would be easy to simply ignore the entire subject if it were not for the fact 
that functional equations in a single variable are commonplace in mathematics 
competitions. So I have done my best to present those tools and unifying 
concepts which occur periodically in such problems in both high school and 
university competitions. Chapter 3 has been written with a confidence that 
advanced high school students will adapt well to the challenge of a bit of 
introductory university level mathematics. The chapters can be read more-or- 
less independently of each other. There are some results in later chapters which 
depend upon earlier chapters. However, the reader who wishes to sample the 
book in random order can probably piece together the necessary information. 
The fact that it is possible to write a book whose chapters are not heavily 
dependent is a consequence of the character of functional equations. Unlike 
some branches of mathematics, the subject is wide, providing easier access 
from a number of perspectives. This makes it an excellent area for competition 
problems. Even tough functional equations are relatively easy to state and 
provide lots of “play value” for students who may not be able to solve them 
completely. 

Because this is a book about problem solving, the reader may be surprised 
to find that it begins with a chapter of the history of the subject. It is my 
belief that the present way of teaching mathematics to students puts much 
emphasis on the tools of mathematics, and not enough on the intellectual 
climate which gave rise to these ideas. Functional equations were posed and 
solved for reasons that had much to do with the intellectual challenges of 
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their times. This book attempts to provide a small glimpse of some of those 
reasons. 

I have learned much about functional equations from other people. This 
book also owes much to others. So this preface would not be complete with- 
out some mention of the debts that I owe. I have learned much from the 
work of Janos Aczél, Distinguished Professor Emeritus at the University of 
Waterloo. The impact of his work and that of his colleagues is to be found 
throughout the following pages in places too numerous to mention. The initial 
stages of this monograph were written at the instigation of Pat Stewart and 
Richard Nowakowski. Sadly, Pat Stewart is no longer with us, and is missed 
by the mathematical community. Thank you, Patrick and Richard. Finally, I 
would like to thank Professor Ed Barbeau, who generously sent some of his 
correspondence problems to me. His encouragement and assistance are much 
appreciated. 


1 


An historical introduction 


1.1 Preliminary remarks 


In high school algebra, we learn about algebraic equations involving one or 
more unknown real numbers. Functional equations are much like algebraic 
equations, except that the unknown quantities are functions rather than real 
numbers. This book is about functional equations: their role in contempo- 
rary mathematics as well as the body of techniques that is available for their 
solution. Functional equations appear quite regularly on mathematics com- 
petitions. So this book is intended as a toolkit of methods for students who 
wish to tackle competition problems involving functional equations at the high 
school or university level. 

In this chapter, we take a rather broad look at functional equations. Rather 
than focusing on the solutions to such equations—a topic for later chapters— 
we show how functional equations arise in mathematical investigations. Our 
entry into the subject is primarily, but not solely, historical. 


1.2 Nicole Oresme 


Mathematicians have been working with functional equations for a much 
longer period of time than the formal discipline has existed. Examples of early 
functional equations can be traced back as far as the work of the fourteenth 
century mathematician Nicole Oresme who provided an indirect definition 
of linear functions by means of a functional equation. Of Norman heritage, 
Oresme was born in 1323 and died in 1382. To put these dates in perspective, 
we should note that the dreaded Black Death, which swept through Europe 
killing possibly as much as a third of the population, occurred around the 
middle of the fourteenth century. Although the origins of the Black Death 
are unclear, we know that by December of 1347, it had reached the western 
Mediterranean through the ports of Sicily, then Sardinia, then the port city 
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of Marseilles. It reached Paris in the spring of 1348, having spread throughout 
much of the country. 

The year 1348 is also of some significance in mathematics, because that 
is the year that Nicole Oresme is recorded in a list of scholarship holders 
at the University of Paris. Thus it appears that Oresme was studying at 
the University of Paris from some time in the early 1340s up to the time 
the Black Death arrived in the city itself. Today, perhaps, we might well 
wonder how, in the face of this calamitous disease, scholars such as Oresme 
were able to flourish. However, the Black Death was more disruptive for the 
generation that followed Oresme. He and his colleagues had completed much 
of their education by the time that the Black Death’s devastating effects were 
felt. By 1355, Oresme had obtained the Master of Theology degree, and was 
soon thereafter appointed Grand Master at the College of Navarre, one of 
the colleges of the University of Paris, founded in 1304. Nicole Oresme was 
arguably the greatest European mathematician of the fourteenth century. He 
died in 1382 in Lisieux. 

Although he lived in a medieval world in which the writings of Aristotle 
were the dominant influence on natural philosophy—what we would now call 
the natural sciences—Oresme’s scholarly work foreshadowed the work of later 
writers in the Renaissance and Enlightenment periods, who broke away from 
Aristotle to reformulate the laws of mechanics and, by so doing, create the field 
of classical physics. In 13852, Oresme wrote a major treatise on uniformity and 
difformity of intensities, entitled Tractatus de configurationibus qualitatum 
et motuum. In this important work, Oresme established the definition of a 
functional relationship between two variables, and the idea (well ahead of 
Réné Descartes) that one can express this relationship geometrically by what 
we would now call a graph.! In Part 1 he wrote 


Therefore, every intensity which can be acquired successively ought 
to be imagined by a straight line perpendicularly erected on some 
point of the space or subject of the intensible thing, e.g., a quality. 
For whatever ratio is found to exist between intensity and intensity, in 
relating intensities of the same kind, a similar ratio is found to exist 
between line and line, and vice versa.” 


' Part of Oresme’s efforts were dedicated to proving the so-called mean speed theo- 
rem, also known as the Merton theorem because of its associations with the work 
at Merton College, Oxford. William of Heytesbury, a Fellow of Merton College 
summarised this result in his treatise Rules for Solving Sophisms in 1335 by say- 
ing “the moving body, acquiring or losing . . . [velocity] uniformly during some 
period of time, will traverse distance exactly equal to what it would traverse in an 
equal period of time if it were moved uniformly at its mean degree [of velocity].” 
An important consequence of this theorem is that a body undergoing a constant 
acceleration (such as a freely falling body) will traverse a distance which is a 
quadratic function of time. 

? This is the 1968 translation of Marshall Clagett. See Oresme [1968]. 
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Of central interest in his treatise is the idea of uniform motion and “uniformly 
difform motion,” the latter denoting the motion of a particle undergoing uni- 
form acceleration.? Also considered was “difformly difform motion,” where 
the acceleration itself varied. In the section on quadrangular quality, Oresme 
took care to define his notion of uniform difformity (i.e., linearity) as follows. 


A uniform quality is one which is equally intense in all parts of the 
subject, while a quality uniformly difform is one in which if any three 
points [of the subject line] are taken, the ratio of the distance between 
the first and the second to the distance between the second and the 
third is as the ratio of the excess in intensity of the first point over 
that of the second point to the excess of that of the second point over 
that of the third point, calling the first of those three points the one 
of greatest intensity. 


As Aczél [1984] and Aczél and Dhombres [1985] have noted, the passage de- 
fines a linear function (i.e., a quality which is uniformly difform) through a 
functional equation. In modern terminology, we would have three distinct+ 
real numbers x, y, and z, say, which are described in the passage above as 
three points of the subject line. Associated with x, y and z, we have a variable 
(i.e., the “intensity” of the quality at each point of the subject line) which we 
can write as f(x), f(y), and f(z), respectively. The function f is defined to 
be linear, or “uniformly difform” if 
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= = AAS Fa) for all distinct values of x, y, z. (1.1) 
What makes Oresme’s definition a functional equation is that f is treated 
abstractly: one may plug any function into this equation to see whether the 
equation is satisfied for all possible x, y and z. We can compare this with the 
standard definition to be found in most introductory modern textbooks which 
say that a linear function is one of the form 


f(z) =axr+b for some a, b. (1.2) 


3 Although in modern mathematical terms we would probably prefer to translate 

the word “uniform” as “constant” here. Thus we could say in the case of uniform 
motion that a particle which undergoes motion with a constant velocity changes 
its position uniformly, i.e., linearly, as a function of time. The preferred choice of 
terminology can be left to the taste of the reader. 
By “distinct” here, I mean that no pair of the three numbers can be equal. For 
the purposes of the definition it is sufficient to require that y # z. However, 
the geometric language in which Oresme frames his definition clearly points to 
the interpretation given. Note also that we need to take a small liberty with the 
text and interpret the word “distance” as “signed distance” in the modern sense. 
Trying to resolve this ambiguity by ordering the points and requiring the function 
to be increasing is too artificial. 
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Fig. 1.1. One of Oresme’s “uniformly difform” (i.e., linear) functions, defined by 
a functional equation. Redrawn and adapted from a manuscript diagram in his 
Tractatus de configurationibus qualitatum et motuum. An illustration for the Mean 
Speed Theorem. 


Oresme’s equation in (1.1) is a functional equation. The definition in (1.2) with 
a # (is its solution. Note that Oresme’s definition does not allow the constant 
linear function where a = 0. This is faithful to his intentions here, which 
distinguish uniformly difform functions from uniform functions according to 
the choices a £ 0 and a = 0, respectively. 


1.3 Gregory of Saint-Vincent 


Over the next few hundred years, functional equations were used but no gen- 
eral theory of such equations arose. Notable among such mathematicians was 
Gregory of Saint-Vincent (1584-1667), whose work on the hyperbola made 
implicit use of the functional equation f(xy) = f(x) + f(y), and pioneered 
the theory of the logarithm. 

Saint-Vincent’s result appeared in his great 1647 treatise entitled Opus 
Geometricum quadraturae circuli et sectionum coni. If the title of this work 
appears long, the treatise itself, at about 1250 pages, was much longer! It deals 
with methods for calculating areas and with the properties of conic sections. 
In particular, Saint-Vincent shows how it is possible to calculate the area 
under an hyperbola such as y = x! as in Figure 1.3. In modern times, the 
area under a curve such as an hyperbola is a topic usually left for the theory 
of integration. However, Saint-Vincent made great progress on the problem 
using purely geometric arguments. 

In particular, Saint-Vincent’s argument was based upon the following ge- 
ometrical principle. 


1.3 Gregory of Saint-Vincent i) 


fi 


Fig. 1.2. Simultaneous stretching and shrinking of a planar region in perpendicu- 
lar directions. If a region is simultaneously stretched and shrunk in perpendicular 
directions by the same factor, the area will be unchanged. 


N 


Tie) =f) + fo) 


area = fix) 


Fig. 1.3. Gregory of Saint- Vincent pioneered the theory of logarithms by recognising 
that the area under an hyperbola satisfies a functional equation. Using the geometric 
argument for constant area shown above, Gregory of Saint- Vincent was able to derive 
the functional equation now associated with the logarithm. 
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If a planar region is stretched horizontally by a given factor, and si- 
multaneously shrunk vertically by the same factor, then the resulting 
region will have an area which is equal to that of the original region. 


For example, in Figure 1.2, we see that a planar region has been scaled ver- 
tically and horizontally by stretching with a factor of 2 horizontally, and 
shrinking by a factor of 2 vertically. The second region has the same area as 
the first. 

Now let us see how this geometrical principle applies to the area under the 
hyperbola y = a~+. Let f(a) denote the shaded area on the interval from 1 
to x shown in Figure 1.3, and consider the corresponding shaded area under 
the same hyperbola erected on the interval from y to xy for any y > 1 say. 
Comparing the two shaded regions, Saint-Vincent noted that they differ by 
a scale factor of y along the x-axis, and by a scale factor of y~! along the 
y-axis. Thus the areas of the two regions must be the same. 

The area of the shaded region with base from y to xy is f(x y) — f(y). This 
follows immediately from the fact that the region under the hyperbola from 
y to xy is exactly that which is obtained by removing the region from 1 to y 
from the region from 1 to xy. Thus, using Saint-Vincent’s scaling argument, 
we have 


f(@) = f(xy) — FY) 
or equivalently that 
f(xy) = fle) + Fly). 


We now recognise this equation as the distinctive functional equation 
for the family of logarithms. However, the theoretical work which links this 
functional equation to the family of logarithms had to wait for the work of 
Augustin-Louis Cauchy. 

Along with his work on conics and the calculation of area, Saint-Vincent 
is also remembered for the contributions from the second part of his treatise 
Opus geometricum, where he studied infinite series. With his work on areas, 
the method of exhaustion, and series, Gregory of Saint- Vincent was one of the 
early pioneers of the modern methods of calculus and analysis. 


1.4 Augustin-Louis Cauchy 


Although Nicole Oresme’s definition of linearity can be interpreted as an early 
example of a functional equation, it does not represent a starting point for 
the theory of functional equations. The subject of functional equations is 
more properly dated from the work of A. L. Cauchy. Born in 1789 in Paris, 
France, Cauchy’s early years coincided with the French Revolution. To put his 
birthdate in context, we should recall that the French Revolution is generally 
dated to the ten-year period 1789-1799, starting roughly with the storming of 
the Bastille in 1789. In 1799, when the young Cauchy was ten years old, general 
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Napoleon Bonaparte led a coup against the Directoire to begin a period of 
direct military rule of France. As Cauchy’s family had royalist sympathies, 
they left Paris and did not return until 1800. A strong monarchist himself, 
Augustin-Louis Cauchy later ran counter to the republican and Napoleonic 
trends in France during his early years. A brilliant mathematician, Cauchy 
worked in many areas of mathematics. However, he is primarily known for 
his work on calculus, and is recognised as one of the founders of the modern 
theory of mathematical analysis. 
The functional equation that is particularly associated with Cauchy is 


flet+y) = f(a) + fy) (1.3) 


for all real x and y, and is now called Cauchy’s equation. It is required to 
find all real-valued functions f satisfying equation (1.3). Now the reader can 
immediately notice that Cauchy’s equation is satisfied by any function of the 
form 
f(a) = ax, 

the constant a being an arbitrary real number. However, our ability to find a 
simple solution to this equation is only a small part of the story. We must also 
ask whether the family of functions of the form f(x) = ax is the complete 
set of all solutions to equation (1.3). It seems reasonable that such linear 
functions are the only solutions to (1.3). However, this turns out to be true 
only if some mild restriction is imposed upon the function f. For example, 
functions of the form f(a) = az are the only solutions to (1.3) among the 
class of functions which are bounded on some interval of the form (—c, c), 
where c > 0. Alternatively, it can be shown that f(x) = ax form the only 
class of solutions among the continuous real-valued functions on the real line. 
We investigate this equation and its solutions in detail in Chapter 2. 

What was the motivation for Cauchy’s investigation of the full set of so- 
lutions to (1.3)? To understand this, we must examine Cauchy’s rigorous 
derivation of the general statement of the binomial theorem. For centuries, 
mathematicians have known the formula 
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for nonnegative integer values of n. The binomial coefficients of this sum, 


namely (") (ei 2) ED 


a i! 


(1.5) 


are the well-known entries in Pascal’s triangle. 

In fact, the binomial coefficients defined in (1.5) are meaningfully defined 
by the formula when n is replaced with any real number 2z, as long as i remains 
a nonnegative integer. It was Isaac Newton who, in 1676, demonstrated how 
to extend the binomial formula in (1.4) in order to expand (1+ 2)* in powers 
of « when z is an arbitrary real number. Newton’s formula 
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(te) =1tz0+ (G)o74() 84 (1.6) 
is an infinite sum, which reduces to the finite sum in (1.4) because terms after 
the (z+ 1)st vanish when z is a nonnegative integer. Unfortunately, Newton’s 
proof of (1.6) was not rigorous. So it was left to subsequent mathematicians 
to fill in the entire argument. Cauchy began by considering the right hand 
side of equation (1.6). The first problem that arises is whether this expression 
has any meaning at all. For example, if z = —1 and x = 1 we get 


which does not converge to a finite real number as more and more terms are 
summed, despite the fact that the left hand side of (1.6) equals 1/2. Cauchy 
rigorously demonstrated that when || < 1 the right hand side does converge 
to a (finite) real number for all real values of z. So he defined a function 


f(z) = tect (3)ar+(S)a%+... (1.7) 


and showed by combinatorial arguments and rules for multiplying two such 
infinite sums that 

flz+w) = f(z) fw) (1.8) 
for all real z and w. This functional equation is called Cauchy’s exponential 
equation, and we meet it again in the next chapter. It is tempting simply to 
take logarithms in order to turn it into (1.3). However, we have to be careful 
to check that f(z) > 0 before we can say that log f(z) is meaningful. As we 
show, a solution to Cauchy’s exponential equation is either everywhere zero 
or is everywhere strictly positive. As the latter is the case here, we can take 
logarithms to obtain the equation 


g(z + w) = g(z) + g(w) 


where g = log f. In order to show that g(z) = az for some z, Cauchy had to 
demonstrate that f, and therefore g, are continuous functions. This turned 
out to be harder than he expected. Having verified that g(z) = az for some 
a, he was able to conclude that f(z) = b* for some value of b. In particular, 
f(1) = 0. It remained for Cauchy to observe that b = 1+ 2, a fact that is 
immediately deduced by setting z = 1 in (1.7). 


1.5 What about calculus? 


Readers who know some calculus may wonder why Cauchy’s equation f(x + 
y) = f(x) + f(y) cannot be solved by differentiating. 

Substituting y = c, a constant, and differentiating with respect to x, we 
get 
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f'(a+o) = f'(z) 
for all real c. This implies that f’ is a constant function, and therefore that f is 
a linear function of the form f(x) = ax + b. Substituting back into Cauchy’s 
equation, we determine that 6 = 0. 

This argument is fine, as far as it goes. However, it makes a very big 
assumption, namely that f can be differentiated. Although many of us are 
used to making this assumption as a matter of course, we would prefer to 
prove results without any assumptions other than those which are strictly 
necessary. Functions which do not have derivatives at certain points are quite 
common. For example, the function f(x) = |x| has a derivative everywhere 
except at x = 0. In higher mathematics, it is not uncommon to consider 
functions which have no derivative at any value of x. A good example of such 
a function is 


f(x) = 


This function is not continuous for any value of x. So it has no derivative for 
any value of x. It is even possible to construct continuous functions which do 
not have a derivative at any value x. 

The point is that we do not wish to rule out functions from consideration by 
automatically assuming that we can differentiate f(a). We will have occasion 
to assume that derivatives exist in subsequent functional equations. However, 
such assumptions should always be made sparingly, with an eye to removing 
them if they are unnecessary for the proof of the result. 


1 if x is rational 
0 if xz is irrational. 


1.6 Jean d’Alembert 


Historically, Jean d’Alembert precedes Augustin-Louis Cauchy. However, in 
the context of functional equations, it seems more natural to consider his 
contributions after Cauchy. 

Jean d’Alembert was a man of many names. The illegitimate son of an 
army officer, Louis-Camus Destouches, and a writer, Claudine Guérin de 
Tencin, he was born in Paris in 1717, while his father was abroad. Shortly 
after his birth, his mother abandoned him at the church of Saint-Jean-le- 
Rond. Following tradition, he was named Jean le Rond after the church, and 
placed in an orphanage. Upon the return of his father, he was removed from 
the orphanage, and placed with Mme. Rousseau, the wife of a glazier. Al- 
though Destouches continued to support his son financially, he chose not to 
publicly acknowledge his son. In 1738, Jean le Rond entered law school, where 
he was registered under the name Daremberg. He later changed this name to 
d’Alembert. 

In their efforts to understand the principles of combinations of forces, 
mathematicians of the eighteenth century, such as d’Alembert, were led to 
the equation 
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g(x +y) + g(@— y) =29(x) gly), (1.9) 


where 0 < y < x < 1/2. Equation (1.9) is now known as d’Alembert’s equation. 
It is required to find all real-valued functions g satisfying equation (1.9). 
Here, we encounter a greater level of difficulty in the analysis compared 
to Cauchy’s equation. The equation is reminiscent of a trigonometric identity. 
Trying out our elementary trigonometric functions, we observe that g(x) = 
cos x works, but g(a) = sin x does not. Are there any other solutions? Playing 
with our solution a bit, we are tempted to look for solutions of the form 


g(x) = b cos ax 


for suitably chosen constants a, b. However, letting x = y = 0 in equation (1.9) 
reduces it to the equation g(0) = g(0)?, telling us that g(0) = 0 or g(0) = 1. 
These correspond to the cases b = 0 and b = 1 respectively. The constant a 
turns out to be arbitrary: if g(x) is any solution to equation (1.9) then g(az) 
is also a solution. 

This tells us that we can expand our single solution to include 


Are there any other solutions to d’Alembert’s equation? It turns out that 
there are. Once again, it was Cauchy [1821] who managed to show that a full 
set of continuous solutions to (1.9) is given by 


0 
g(x) = ¢ cos ax (1.10) 
(b? + b-*)/2, b>O0. 


The seeming incongruity between the second and third lines in (1.10) can 
be explained by the fact that (b” + b~”)/2 can be written in terms of the 
hyperbolic cosine of x. The hyperbolic cosine shares many standard algebraic 
identities in common with the usual (circular) cosine, and this explains its 
ability to satisfy d’Alembert’s equation as well. Indeed, if we choose b to be 
the famous constant e + 2.71828, then the resulting formula is the definition 
of the hyperbolic cosine. 


1.7 Charles Babbage 


One property that both Cauchy’s and d’Alembert’s equations have in common 
is that, although they involve functions of a single variable, the equations are 
formulated using two variables, namely x and y. A rather different class of 
functional equations was investigated by the British mathematician Charles 
Babbage. 
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Fig. 1.4. Charles Babbage (1791-1871). 


Charles Babbage is best remembered as a founder of modern comput- 
ing. Although his difference engine and his analytical engine—the latter pro- 
grammable by punched cards—were not built during his lifetime, some of 
his other inventions were. Among his other inventions was the cowcatcher, 
that remarkable device that was attached to the front of trains to remove 
obstacles—such as cows—that might cause the train to be derailed. Today 
this device is most commonly seen in movies of the old west of the old Hol- 
lywood tradition where the steam locomotive, with cowcatcher, has achieved 
iconic status. His other inventions include skeleton keys. His mathematical 
studies of the postal system were also to lead to the idea of the flat rate 
postage stamp. 

However, it is Charles Babbage the mathematician that we shall be con- 
cerned with here. In 1812, together with Robert Woodhouse, John Herschel, 
the son of the astronomer William Herschel, and George Peacock, he founded 
the Cambridge Analytical Society. It was the intention of the Analytical So- 
ciety to promote the use of Leibniz’ methodology for calculus over the New- 
tonian version which was entrenched in British mathematics at the time. In 
1819, the Society was renamed the Cambridge Philosophical Society. It is by 
this name that it is known today. 
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On June 15, in 1815, Charles Babbage read a paper before the Royal Soci- 
ety of London that made his reputation as a mathematician of high originality 
and ability.® 

Babbage opened this paper by noting that many mathematical calculations 
consist of two parts: the direct calculation and its inverse. For example, raising 
a number to a power is a direct calculation, whereas the extraction of a root 
is its inverse operation. In most cases, the inverse operation is of much greater 
difficulty than the direct operation. Babbage remarked that 


It is this inverse method with respect to functions, which I at present 
propose to consider. 


More explicitly, we may say that if a function is given, then we may determine 
some particular equation that the function satisfies; this would be a direct 
calculation applied to the function. The inverse problem would be to start 
with a given equation and to determine the family of functions satisfying that 
equation. 

From this broad outline, Babbage’s argument proceeds by way of a se- 
quence of examples of functional equations and their solutions, of increasing 
generality and complexity. The first part of the paper considers equations of 
the form 


Fle, f(x), f(ar(a)), f(a2(#)), ---, flan(2))] =0, (1.11) 


where functions F' and aj,...Q@, are given. It is required to find all functions 
f satisfying the equation for each particular choice of F and a. For example, 
the first problem treated in the paper is to determine all functions f satisfying 
the functional equation 

f(x) = fla(z)] (1.12) 
for given function a(a). Upon inspection it is easily seen that for many func- 
tions a there are infinitely many functions f satisfying this equation, and that 
these functions are quite diverse in nature. If a particular solution fo has been 
found, then all functions of the form 


f(x) = o[fo(2)], (1.13) 


where o is arbitrary, will all satisfy the functional equation, although the 
family of such functions may not be the complete collection of all solutions. 


> An essay towards a calculus of functions, Philosophical Transactions of the Royal 
Society of London 105, 389-423. This paper was the first of several on the subject. 
It was followed by An essay towards the calculus of functions, part II, Phil. Trans. 
106, 179-256, which appeared in 1816. In 1817, he published two papers entitled 
Observations on the analogy which subsists between the calculus of functions and 
the other branches of analysis, Phil. Trans. 107, 197-216, and Solutions of some 
problems by means of the calculus of functions, Journal of Science, 2, 371-79. 
In 1822, there followed the paper Observations on the notation employed in the 
calculus of functions, Trans. Camb. Phil. Soc. 1, 63-76. 
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Example 1.1. Of particular interest in equation (1.12) are special cases such 
as 


fla) = 4 ( M (1.14) 


x—-1 


where a(z) = x/(a — 1). In this case, the function a(x) is an involution, in 
the sense that if it is applied twice to any value x 4 1, we get the identity: 


pat ae 
a, x a, x—-1 


ut? > =. 
x—1 (=) 54 
x—1 
Better known examples of involutions include 2 ++ —x and ++ x~!. A general 
family of solutions to (1.12) when a(x) is an involution is 


f(x) =7 [a, a(x)] , (1.15) 


where t(u,v) is an arbitrary symmetric function of wu and v. (That is, 
T(u,v) = T(v,u).) An interesting function which is a solution to (1.14) is 
f(a) = coslog(a — 1). The reader can consider how to express this particular 
solution in the form given in (1.15). 


An extension of this problem is to find solutions to two or more simulta- 
neous functional equations, such as 


f(x) = fla(z)], —— F(@) = FIP) (1.16) 
for given functions a(x) and G(x). 


Example 1.2. The pair of simultaneous equations 


where, once again, o is arbitrary. 


In the second part of the paper, Babbage studied families of equations 
which, up to that point, had not received any systematic treatment in the 
literature. These n“” order functional equations, as he called them, are of the 
form 


Bae Fh (2p cans Fe) 0s (1.17) 


where, once again, F’ is given and f is to be determined. Here 
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and so on. In this case, the function F is known, and one seeks to find all 
functions f such that equation (1.17) holds for all real 2. For example, a 
solution f(a) to the equation 


fr(a) =a 


would be an iterative n“” root of the identity function x +> x. In particular, 
we can recognise that an iterative square root of the identity function is an 
involution as discussed above. Babbage noted that if r(u,v) is any arbitrary 
antisymmetric function of u and v,° then any involution f(x) must satisfy 


tla, f(x)] =0. (1.18) 


Such an equation can be used to find involutions. 


2 


Example 1.3. Setting T(u,v) = u? — v? gives us two involutions, namely 


f(x) =a (the trivial involution) and f(a) =-«2. 


A less obvious example is T(u, v) = (u? + v) — (v? + u), from which we obtain 
the equation 

(a? — x) + f(z) — [f(x)]? = 0 
This is a quadratic in f(x) that is solved by 


1+ /1-42(1-2) 1—,/1-42 (1-2) 
fg) == and f(z) = 


both of which are easily checked to be involutions using the identity 


f(x) [L- f(@)] = 2-2). 
In a similar manner, many other solutions to f?(a) = x can be obtained. 


Using such solutions, Babbage noticed a method for generating many more. 
If f(x) is an involution, then so is 


g(x) = o*{ f[o(a)] } (1.19) 


for any one-to-one function ¢. This is because 


Problem 12 of Babbage’s paper asks for solutions to the equation 


® That is, 7(u,v) = —T(v, u). 
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Pa) = a(z) (1.20) 


for any given function a(x). Suppose we search for solutions of the form 


f(x) =o "{ Bld(2)]}, 


where @ is some given function—but not an involution unless a(#) = «—and 
@(z) is a one-to-one function to be determined. Then 


a(x) = f(2) 
=o { P[6(@)]}, 


which implies that ¢(x) must be a solution to the functional equation 


gla(x)] = 6?[d(2)]. (1.21) 


This is a type of functional equation called a conjugacy equation. We discuss 
equations of this sort in Chapter 3. Although the reduction of (1.20) to a 
conjugacy equation does not amount to a solution, it is easier in some cases 
to solve an equation such as (1.21) than equation (1.20). The hope is that 
we can reduce a given equation involving an iterative root to a conjugacy 
equation of a particularly simple sort that we can solve. This is one of the 
topics that we discuss in Chapter 3, where we develop some methods to solve 
special conjugacy equations due to Abel and Schréder. 


Example 1.4. It is easy to check that a solution to 
Pasa, -a 40 (1.22) 


is given by 
f(a) = —aFen) , (1.23) 


where sgn(z) denotes the signum function of « whose value is +1 when x is 
positive and —1 when zx is negative. 


Example 1.5. Let us find a solution to 


ayy. 
f(z) = xrt+1- 
Following Babbage’s method, we choose 
Ala) =2+ 5 


Thus 6?(2) = 2+ 1. We search for a function ¢(x) such that 


f(x) = & *{6[9(x)] } 


will work. So ¢(a) must be a solution to the conjugacy equation 
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a( : ) = ole) +1. 


at+1 


This is also a bit hard to solve. However, we can write it as 


1 1 
(=) 1+ 4(z) a 


which is immediately seen by inspection to be solved by ¢(x) = x~!. Therefore 
our solution has the form 


II 
Gon ON 
8 

ly 

+ 
Nl rR 
es 
U 


Q+2 


It is easily checked that this is a solution. 


1.8 Mathematics competitions and recreational 
mathematics 


The subject of functional equations has also found a niche in recreational 
mathematics and in mathematics competitions. 

The idea of throwing out mathematical challenges and puzzles for other 
enthusiasts to solve is very old. For example, during the Italian Renaissance, it 
was commonplace for mathematicians to issue public challenges to others for 
the solutions to certain problems. Nowadays, a similar function is provided by 
numerous journals dedicated—in whole or in part—to problem solving. Many 
of these journals encourage the publication of problems which require only 
high school mathematics, or undergraduate mathematics at most. For exam- 
ple, in the United States and Canada, journals such as Mathematics Mag- 
azine, published by the Mathematical Association of America, devote space 
to mathematics problems. The journal, Crux Mathematicorum, published by 
the Canadian Mathematical Society, is dedicated entirely to publishing origi- 
nal problems. Its full title, Cruz Mathematicorum with Mathematical Mayhem 
reflects the fact that it is a union of two previous journals, Cruz Mathemati- 
corum and Mathematical Mayhem. As an independent journal, Mathematical 
Mayhem was written by, and for, students. 

Among problems appearing in such journals, functional equations often 
appear because their solutions often require only elementary manipulations 
without advanced theory. Another forum in which high school or undergradu- 
ate students can participate is in mathematics competitions. Many countries 
offer a mathematical olympiad system for high school students with a system 
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of competitions that culminates in a national olympiad. Beyond that, the In- 
ternational Mathematical Olympiad (IMO) involves more than 80 countries, 
each country being represented by an elite group of six students. Functional 
equations often appear among the questions posed at the IMO, and are reg- 
ularly shortlisted for consideration each year. 

At the university level, the choices for students are fewer. However, in 
North America, the William Lowell Putnam Competition is held regularly on 
the first Saturday in December at more than 300 universities and colleges in 
the United States and Canada. Problems involving functional equations made 
an early appearance in the history of this competition. 

In early competition problems, the study of functional equations has been 
classified more as university mathematics, in part because the subject has 
obvious affinities with topics that are studied in more advanced university 
courses, such as differential equations, integral equations, and the theory of 
representations of groups. So it is not surprising that many of the early ex- 
amples of functional equations can be found on the William Lowell Putnam 
Competition. Many of these problems are only accessible to students with 
some background in calculus and differential equations, which for the pur- 
poses of this book, we have split off as a separate discipline. For example, 
Problem A3 from the third Putnam competition in 1940 asked for all func- 


tions f(a) such that 
[irorrae=|f rear| 


when constants of integration are suitably chosen. Here, we choose not to 
classify this problem as functional equations because the equation involves 
calculus. 

The very first problem on the Putnam competition where a functional 
equation appears is in the fourth Putnam competition held in 1941. 


Example 1.6 (Fourth Putnam Competition, Problem B14(i)). Show that any 
solution f(t) of the functional equation 


fle+y) f(e-y) =[f@P + [F@)P? -1 


(a and y real) is such that 


f"(t) = +m? ft), 


where m > 0 is a constant. We assume the existence and continuity of the 
second derivative. 


The reader will notice the spirit of d’Alembert’s equation runs through this 
example, although the equation is different. This time the problem involves 
a functional equation. However, both the conclusion and the solution involve 
calculus. 
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The first example on the Putnam competition which involves the kinds 
of elementary methods (with no calculus) such as Cauchy used to solve his 
equation and that of d’Alembert is Problem 2 from the morning session of the 
seventh Putnam competition held in 1947. 


Example 1.7 (Seventh Putnam Competition, Problem A2). A real-valued con- 
tinuous function satisfies, for all real x and y, the functional equation 


f (V2? +9?) = fe) fy). 


Prove that 


f(x) = [FI 
We do not consider the solution to this problem until later. However, it should 
come as no surprise that Cauchy’s equation is fundamental in its solution.” 
Thus we defer more discussion until the next chapter. 

This early example illustrates one of the difficulties with such func- 
tional equations as competition problems. The examiners for that year ei- 
ther expected students to recognise—perhaps without proof—the solution 
to Cauchy’s equation, or expected the students to duplicate all or part of 
Cauchy’s methodology. As we show in the next chapter, this methodology is 
by no means difficult to follow. However, it is a mathematically sophisticated 
approach to the problem that is very different in style to the kinds of so- 
lutions to differential equations that the students would have been used to. 
Perhaps it is an act of faith to put a problem like this on a competition for 
students at any level. Fortunately, there are many functional equations which 
can be solved without using the mathematically sophicated style of the so- 
lution to this problem. Equations which can be solved without appealing to 
concepts such as continuity are often fair game for high school mathematics 
competitions. 

Thereafter, functional equations arise sporadically over the years on the 
Putnam. There are polynomial functional equations, in which the class of 
solutions is restricted to polynomials of arbitrary degree. See, for example, 
Problem B7 from the sixteenth competition in 1956. There are also problems 
such as B6 from the fifteenth competition in 1955 which involve a restriction 
on the domain of the function, in this case to positive integers. The task of 
enumerating these problems is not a particularly fruitful exercise, because 
the classification of the problems is often artificial: functional equations de- 
fined on positive integers are often equivalent to problems involving sequences; 
functional equations involving polynomials may have more in common with 


” The statement of this problem illustrates one of the hazards of putting functional 
equations on mathematics competitions. It is easy to leave out the necessary 
regularity required to ensure the conclusion. In this case, the function f(x) = 0 
for all x satisfies the equation. However, for x = 0, the value of fay” = 0° is 
indeterminate. Thus the statement of the problem should rule out this solution. 
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other polynomial problems than with the theory of functional equations, and 
so on. However, there can be no doubt that in 1959 there were two unam- 
biguous functional equations on the Putnam. Problem B3 in 1963 involves 
differentiability. Because the equations themselves do not involve derivatives, 
we may safely classify it as a functional equations problem and not a problem 
in differential equations. 

The following example is question number 5 from the 1972 International 
Mathematical Olympiad, and illustrates how the historical work on functional 
equations has influenced competitive mathematical problem solving. 


Example 1.8. Let f and g be real-valued functions defined for all real values 
of x and y, and satisfying the equation 


fa+y) + f(a —y) = 2 f(x) gy) (1.24) 


for all x,y. Prove that if f is not identically zero, and if |f(x)| < 1 for all a, 
then |g(y)| <1 for all y. 


Solution. As |f(x)| <1 for all z, it follows that |f| has a least upper bound 
M. That is, there exists a real value M such that |f(x)| < M for all x, and 
for all m < M there is some zo such that |f(2o)| > m. Now suppose that 
there exists a point yo such that |g(yo)| > 1. We prove a contradiction. Using 
(1.24) and the triangle inequality, we get 


2|f(x)| |9(yo)| = |f(x + yo) + f(x — yo)| 
< |f(z + yo) + [f(x — yo)| 
<2M 


So 


This is a contradiction, because M has been defined as the least upper bound 


of | f(«)|. 


The functional equations (1.24) and (1.9) are special cases of a more general 
functional equation, namely 


f(e+y) + f(a —y) = g(a) hy). (1.25) 


Equations of this kind were of interest to applied mathematicians of the eigh- 
teenth century. The right-hand side denotes the situation where the two vari- 
ables x and y “separate” into the factors g(x) and h(y) as shown. 

Earlier, we looked at the work of Charles Babbage, and his results on 
iterative roots. Problems involving iterative roots form an interesting class 
of functional equations that commonly arise in mathematics competitions. A 
typical form for such a problem is the following. 
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Example 1.9 (Putnam 1979, Problem A2). Establish necessary and sufficient 
conditions on the constant k for the existence of a continuous, real-valued 
function f(x) satisfying 

f(f(@)) = kx? (1.26) 


for all real x. 


We do not solve this problem here, but postpone consideration of such exam- 
ples to Chapter 3, where we consider iterative roots in greater detail. More 
common examples of iterative roots occur where function composition using 
f(x) appears on both sides of the equation. The example given above is no- 
table because it is of the simple form that Babbage considered. 

Many functional equations involving iterative square roots are closely re- 
lated to particular recursions. Consider, for example, the functional equation 


f[f(@)| = «+ flex). (1.27) 


Starting at any value x, we can construct a sequence x, f(x), f[f(x)], ... by 
successive application of the function f. Letting f; be the ith value in this 
sequence, we see that the fis satisfy the recursion 


fite = fit fis. (1.28) 


In particular, the famous Fibonacci numbers 


fA=1, fo=l, fs=2, fa=3, fs=5, fe=8, fr=13,... 


satisfy this recursion. However other sequences satisfying (1.28) can be chosen 
by arbitrarily specifying f; and fo. 

Now such recursions are well studied. For example, it is well known that 
the general form of the solution to (1.28) is given by 


fr = a (54) +b (54) (1.29) 


where the constants a and b can be chosen arbitrarily so to determine the 
starting values f,; and fo. For example, we can obtain Binet’s formula for 
the Fibonacci sequence by setting a = 1/5 and b = —1/V/5. Other values 
of a and b give generalized versions of the Fibonacci sequence with different 
starting values. The quantity 


_1+V5 
2 


@ 


(1.30) 


is the golden ratio, which can be obtained as the limiting form of the ratio of 
successive terms of the Fibonacci sequence. As @~! = (/5—1)/2, the formula 
for f; can also be written as f; = a&' + b(—&)~*. 
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From (1.29), it is not hard to find a solution to (1.28): it is immediate that 
f(a) = @z is a solution to the functional equation. However, the solution is 
not unique. Another solution is the function f(2) = —®~' a. Are there any 
other functions? 

An example of this kind of functional equation and the issue of uniqueness 
arose as a question on the 1988 Putnam competition. 


Example 1.10 (Putnam 1988, Problem A5). Prove that there exists a unique 
function f defined on the set of positive real numbers such that 


f[f(@)] = 6 — f(a) (1.31) 
and f(x) > 0 for all x > 0. 


Solution. For each x we construct the sequence 


fi=t, fo=f(z),  fs=flf@)], ---. (1.32) 


The sequence satisfies the recursion fi+2 = 6f; — fiz1. Clearly, f; = 0 for all 
i satisfies the recursion. However, this possibility is ruled out by the fact that 
f(x) > 0 for all 2 > 0. Let us look for a solution of the form f; = y’, where 
y # 0. For this to hold, y must be a solution to the characteristic equation of 
the recursion, namely 

y'+y—6=0. (1.33) 


This has the roots y = —3 and y = 2. Checking back, we find that f; = 2° 
and f; = (—3)’ both satisfy the recursion. Now if the sequence {f;} satisfies 
the recursion, then so does any multiple {af;}. Moreover, if {g;} also satisfies 
the recursion, then so does {f; + 9;}. From these two facts we deduce that 


f; = a2? + b(-3)' 


satisfies the recursion for arbitrary real values a and b. Indeed, this is the 
general family of solutions to the recursion. To demonstrate this, note that 
the recursion is completely determined by the arbitrary starting values f; 
and f2. For any choice of f; and f2 we can solve for a and 6 to obtain that 
sequence. 

Now, recall that the original function f(a) is required to be strictly pos- 
itive. However, if b were nonzero, then eventually f; would become negative 
as i becomes large, no matter what value of a is chosen. (Why is this?) So 
this requires that b = 0. Therefore f; = a2‘. This solution to the recursion 
corresponds uniquely to the function f(#) = 2x, which is seen to solve the 
equation. 


1.9 A contribution from Ramanujan 


The theory of nested radicals is closely related to the theory of recursions. So 
it comes as no surprise that some nested radicals can be studied using the 
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methodology of functional equations. In this section, we consider a famous 
example due to Ramanujan. Problems involving the calculation of nested 
radicals occasionally appear in problem-solving journals and on mathemat- 
ics competitions. 

As the name implies, a nested radical is an expression in which a radical 
is contained within one or more other radicals. One of the most famous of all 
such expressions is Francois Viete’s lovely formula 


eB 2 2 
= x x x RT oy 
bov2 Va4V2 oa faa 


We do not prove that the right-hand side converges to a as displayed. How- 
ever, let us consider the infinite nested radical associated with this formula. 
In order for Viete’s formula to converge, the successive factors must get closer 
and closer to one. Therefore 


eee eee 


where the right-hand side is interpreted as the limiting value of the denomi- 
nators of the factors in (1.34). 

Nested radicals such as these can be easily evaluated. However, some others 
cannot be evaluated as easily. For example, in 1911 the Indian mathematician 
Ramanujan posed the problem of evaluating 


Jusayisayrea (1.35) 


Interestingly, this nested radical was to appear in 1966 as Problem A6 on the 
27th Putnam competition, where students were asked to prove that the nested 
radical evaluates to 3. The problem is not a functional equation itself, and the 
official solution did not involve functional equations. However, iterations are 
closely connected to the theory of functional equations. There is a functional 
equation hidden here, which we uncover below. 

The story of Ramanujan and his legendary mathematical abilities has been 
well told in popular literature.® Born in Erode, India, in 1887, Ramanujan 
received a basic education in mathematics. Beyond that, he taught himself 
mathematics from a book by George Shoobridge Carr. The deficiencies of this 
book were arguably the source of some of Ramanujan’s problems in presenting 
his own work to other mathematicians. Be that as it may, Ramanujan’s math- 
ematical talent was not to be suppressed, and he pressed on with his math- 
ematical investigations by studying divergent series and continued fractions 


(1.34) 


® See, for example, the book The Man Who Knew Infinity, by Robert Kanigel, 
published by Scribners, New York, and written for readers with a minimum of 
mathematics necessary to explain some of Ramanujan’s work. 
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in particular. By 1911, Ramanujan had made contacts in the mathematical 
community, and began to publish and solve problems in the Journal of the In- 
dian Mathematical Society, which is where he posed the problem of evaluating 
the nested radical in (1.35), given above. It was typical of such problems that 
they would be published in the Journal, and that solutions would be published 
subsequently upon receipt. However, Ramanujan’s problem, though simple to 
state, remained unsolved until Ramanujan solved the problem himself. His 
clever insight into the problem was through a generalisation of the nested 
radical to the formula 


rtnta=Jart (ntayteyaletn)+ (nba? t (etn) 


(1.36) 
Thus the nested radical posed by Ramanujan is simply the special case where 
xz =2,a=0, and n= 1. Presumably, Ramanujan expected the original nested 
radical to be solved by elementary methods. For example 


3=V/9 
= /14+2V16 


isayi+ave 


and so on. The continuation of this recursion depends upon the ability to 
decompose each perfect square on the extreme right as 


I 


nm=14(n—-1)Jf/(n+1)2. 


Thus Ramanujan’s challenge cannot be said to have required any new math- 
ematics. Nevertheless, the problem went unsolved for a short while. 

Let us look at this nested radical, to see how it relates to the topic of 
this book. The nested radical in (1.35) contains a hidden functional equation. 
Suppose we generalize (1.35) by defining 


fla) = fi tai+ ery (1.37) 


It is not clear that the function f(x) is well defined. However, if it is mean- 
ingful, then squaring both sides gives us the functional equation 


f(a)? =1+2f(2+1). (1.38) 


The reader will recognise that this is an example of a functional equation in 
a single variable, such as Babbage considered. In particular, it is a nonlinear 
functional equation. The use of the word nonlinear is not a reference to the 
nonlinearity of f. Rather, this refers to the fact that the equation involves 
[f(x)]?, and so cannot be written as a linear expression in f(x) and f(a +1). 
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The reader should not be under any misapprehensions about the function 
f(x). While the functional equation looks complex, it has a very simple solu- 
tion. Looking at Ramanujan’s formula in (1.38) we see that f(a) = «+1 solves 
the equation. However, we cannot expect this solution to be the only solution 
to our functional equation. So in order to deduce that f(x) = x +1, we will 
have to extract additional information about f(x) from our nested radical 
representation in (1.37). Looking at (1.36), we see that there is a family of 
functional equations here, satisfying 


f(a)? =ae+(n+a)? +a fle+n) 


with solution 
f(z) =a4+n+a. 


We examine the solution to Ramanujan’s problem and the solution to func- 
tional equation (1.38) in Chapter 3. 


1.10 Simultaneous functional equations 


We can obtain functional equations in a single variable easily from functional 
equations in two or more variables. For example, Cauchy’s equation f(a + 
y) = f(x) + f(y) produces the functional equation f(27) = 2 f(x) in a single 
variable when the substitution y = x is made. It is clear that the class of 
solutions of the latter equation is at least as large as the class of solutions of the 
former equation. Indeed, it is easy to check that the equation f(2x) = 2 f(x) 
has nonlinear continuous solutions that do not satisfy Cauchy’s equation. 

By contrast, in the following example, we see that linearity can be char- 
acterized. 


Example 1.11. Let f : R > R satisfy the equations 
f(x?) = [f@)P, (1.39) 
f(a@+1)= f(x) +1. (1.40) 


Prove that f(«) = = for all real x. 


Solution. From (1.39) we see that f(x) > 0 for all a > 0. In addition, by 
mathematical induction on (1.40) we get 


f(@+n)=f(x)+n (1.41) 


for all positive integers n. Substituting 2 — 1 for x in (1.40) and applying 
mathematical induction again, we see that (1.41) holds for all integers n and 
all real x. 

Now applying (1.39) we get [f(a —1)]? = [f(1 — x)]?. Using (1.41) on this 
identity yields 
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[1+ f(—#)* = [f(x) - 1°. 
We can expand this, noting in passing that [f(—«)]? = [f(«)]? to obtain the 
fact that 
f(-2) = —f(2) (1.42) 


for all real values of x. Equation (1.42) immediately implies that f(0) = 0. 
So, this fact and (1.41) together imply that f(n) =n for all integers n. 

We are part way there. All we need to do is fill in the gaps and prove this 
equation for noninteger values. As shown in future examples, extending such 
identities to all real values often involves using arguments involving limits or 
continuity. This problem is no exception. 

We next prove a statement that is weaker than the one we want. We show 
that if « € [n,n +1], then f(a) € [n,n+ 1]. Suppose x — n > 0. Then using 
(1.41) we see that f(z) —n = f(a —n) > 0. Thus f(x) > n. Similarly if 
(n+1)—2 > 0, we obtain (n+1)— f(x) = (n+1)4+ f(-—2) = f(n+1-2) > 0. 
Therefore we have proved that f(a) is in the interval [n,n +1] whenever z is. 
But this means that 

|f(a) —a| <1 (1.43) 


for all real x. 
Now set k = 2”, where n is a positive integer. Let x > 0. Then f(x) > 0, 
and 


Therefore 
ln—fla)(<a-Pt. (1.44) 


However, this statement is true for all such k. As k becomes large, 2~*+! goes 


to zero when x > 1. So f(x) = # when x > 1. Using (1.41) we can extend this 
statement to all real x. 


1.11 A clarification of terminology 


Those who work on functional equations speak of equations in one, two or 
more variables. Such an expression is not to be confused with the expression 
functions of one or more variables. For example, Cauchy’s equation in (1.3) 
involves a function of a single real variable. However, it is a functional equation 
in two variables, namely the x and y that appear in (1.3). Although most of 
the equations that we study in this book involve functions of one real variable, 
we shall occasionally consider functions of two or more. See Section 2.9. 


26 1 An historical introduction 
1.12 Existence and uniqueness of solutions 


It is unrealistic to hope that every functional equation has one and only one 
solution. Often there are many solutions to a given functional equation. For 
example, consider the equation f(a +1) = f(#). We can construct infinitely 
many solutions as follows. Let g be any real-valued function defined on the 
interval [0,1). Define f(x) = g(x — [a]), where [x] denotes the greatest integer 
less than or equal to x. Then f satisfies the functional equation because (a + 
1) — [w+ 1] = x— [2]. Similarly, the equation f(f(x)) = « has infinitely many 
solutions, among which is the obvious solution f(x) = x. On the other hand, 
in Chapter 3 we show that the equation f(f(x)) = —2 has no solution if f is 
required to be a continuous function. 

In trying to solve functional equations, extraneous solutions may appear 
in the derivation. So it is important that we go back to our original equation 
and check out our supposed solution by plugging it back in. If a derivation 
shows that only one function could be the solution, and that does not work, 
then we have shown that there can be no solution to the original functional 
equation. On the other hand, we should not stop once we have found a single 
solution. A full understanding of the problem does not come until we know 
that the solution set is complete. 


1.13 Problems 
1. Find all possible choices of real-valued functions f(a) and g(y) such that 


f(z) + g(y) =log(1+a+ay+y) 


for all positive x and y. 


2. Let f(x) be a real-valued function defined for all positive x, satisfying 
f(xt+y) = f(xy) for all positive x,y. Prove that f is a constant function. 


3. Let f be a real-valued function such that, for all real x and y, 


fle+y) fe@—y) = (f(x) fm)P- 


Prove that there exists a function g such that f(x) = g(2?). 
4. Determine all functions f : R — R which satisfy 
f(at+«x)-— f(a—2)=4aer 
for all real a and «. 


5. Determine as many solutions as you can to each of the following functional 
equations: 
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a) f(x) f(a+1) = f(2#+ 1) 

b) f(x) f(e@+1) = 5 [f0) + f2@2+1)] 
c) f(x) fl +1) = fe) — f(@+V 

d) f(x) fl +1) = fif(a) + 2] 


6. Without using Cauchy’s solution to d’Alembert’s equation given in (1.10), 
show directly that the only solution g: R > R to d’Alembert’s equation 


gat+y) + g(x — y) =29(x) gly), 
for all real x and y, that satisfies 


lim g(x) =0 


xL—->Co 


is the function g(x) = 0 for all a. 


7. Consider two continuous functions f,g:R — R, satisfying the functional 
equations 


f(a+y) = f(x) fly) — 9(2) aly) (1.45) 
g(x +y) = g(x) f(y) + o(y) f(a), (1.46) 


where f(0) 4 0. Prove the following properties of f and g. 
a) g(0) =0 and f(0) = 1. 


b) Let 
h(a) = [f(x)]? + [9(@)]- 
Show that h satisfies Cauchy’s exponential equation 
h(a + y) = h(x) ly). 


In addition, show that if f(x) and g(x) are solutions, so are a® f(x) 
and a® g(a), where a > 0. 
c) Suppose there exists a constant K such that 


[f(x)]? + [9(@)]? < K 
for all x. Prove that 
[f(2)]? + [g(x))? =1. 


d) (The final part of this question uses calculus. Those unfamiliar with 
techniques of differentiation may wish to skip this part.) Suppose that 
f and g are differentiable, and that f’(0) = 0. Prove that 


[f(2)]? + [9(@))? = 1. 
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10. 


11. 


12. 
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. Prove that f(n) = 1-7 is the only integer-valued function defined on the 


integers that satisfies the following conditions. 
a) f[f(m)] =n, for all integers n; 

b) f[f(m + 2) + 2] =n, for all integers n; 

c) f(0) =1. 


. x Let f(n) be a function defined on the set of all positive integers and 


having all its values in the same set. Prove that, if 


f(n+1) > fif(n)] 


for each positive integer n, then f(n) =n for all n. 


x The following two questions use techniques that are specific to polyno- 
mial functional equations. 
a) Find all polynomials q(x), with real coefficients, satisfying the equa- 
tion 
q(x") = q(x) q(x + 1) 
for all real x. 
b) Find all nonconstant polynomials p(x), with real coefficients, satisfy- 
ing the equation 
p(2?) = p(x) p(x — 1) 


for all real x. 


x Let f be a real-valued function defined for all real numbers x such that, 
for some positive constant a, the equation 


fata) =5+VF@)—V@r 
holds for all real x. 
a) Prove that f is periodic (i.e., there exists a positive number b such 
that f(a + 6) = f(x) for all x). 
b) For a = 1, give an example of a nonconstant function with the required 
properties. 


* Let f be a continuous function defined on the interval [0,1], such that 
for each x € (0,1) there is an h satisfying O0< a-h<a+h<1 and 


jay — LE= N+ 1+), 


Prove that f(x) = ca-+d for some choice of real numbers c and d. 
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13. Suppose that the functions f: R — R and g:R- R satisfy 
flz+g(y)] =2ce+yH+5. 
Find an explicit expression for g[a + f(y)]. 


14. x Suppose f : R > R is a one-to-one continuous function satisfying the 
following. 
a) f[2a— f(x)] =2, for all real z. 
b) There exists a real number 2 such that f(xo) = Xo. 
Prove that f(x) = a for all real x. 


2 


Functional equations with two variables 


2.1 Cauchy’s equation 


Let us begin by restating and solving Cauchy’s functional equation. Let f : 
R — R be a continuous function satisfying 


fle+y) = f(a) + fy) (2.1) 


for all real x and y. We show that there exists a real number a such that 
f(x) = az for alla ER. 
It is straightforward to show by mathematical induction that (2.1) implies 


f(@1 + 2a +-+++2n) = f(a1) + fae) +--+ + fen) (2.2) 
for all x1, ..., Zn € R. A special case of this is found by setting 7; = --- = 
Ln = x, say. Then (2.2) becomes 

f(na) =n f(a) (2.3) 


for all positive integers n and for all real x. Let « = (m/n)t, where m and n 
are positive integers. Then nz = mt. So 


) 
nf (=2) =m f(t) 
But this can be written as 
F(Z) == re (2.4) 


for all t € R. Thus we have proved that 
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Fig. 2.1. Augustin-Louis Cauchy (1789-1857). 


flat) =a f(t) (2.5) 


for all real t and all positive rationals q. 
What about g = 0? We can extend (2.5) to include gq = 0 in the following 
way. Returning to (2.1), we see that 


f(y) = fy +9) 
= f(y) + f(0). 
So f(0) = 0. From this, we immediately have f(0t) = 0 f(t). Therefore (2.5) 
is true for all nonnegative rational q. 
What about q < 0? Once again, returning to (2.1), we obtain 


= f(g) + f(-@). 


Therefore f(—q) = —f(q). From this fact, we get for q < 0, 


flat) = fl-(-a@) 4 
=— fl(-q)#] 
= —(-@) f(t) 
= qf(t). 
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Therefore, 
flat) =a f(t) (2.6) 
for all real values of ¢ and all rational values of gq. 
Now suppose we substitute t = 1 into (2.6). Letting f(1) = a, we deduce 
that f(q) = aq for all rational numbers g. We can summarize what has been 
discovered so far in the following proposition. 


Proposition 2.1. Let f: R- R satisfy Cauchy’s equation 
fa+y) = f(x) + fy) 


for all real values x and y. Then there exists a real number a such that 


f(@=a4q 
for all rational numbers q. 


We are almost there. All we need to do is draw the same conclusion when we 
replace the rational number q by any real number x. To make this leap, we 
finally make use of the assumption that f is a continuous function. Note that 
Proposition 2.1 made no such assumption on f. The following result is just 
the tool for this final step of the argument. 


Proposition 2.2. Suppose that f: R > R and g:R—- R are continuous 
functions such that f(q) = g(q) for all rational numbers q. Then f(x) = g(a) 
for all real numbers «x. 


Proof. This result follows from the fact that any real number can be approx- 
imated arbitrarily closely by rational numbers. For example, we can write x 
with an infinite decimal expansion, and let q; be the rational number obtained 
by terminating the decimal expansion of x after i digits to the right of the 
decimal point. Then 

lim @=2. 

too 
Because f and g are continuous, we can “pass” limits through the functions. 
So 

f(z) = f (jim ai) 


ae.) 


= jim f (4G) - 


Similarly, 
g(x) = Jim g(a). 


However, by assumption the functions f and g agree on all rational numbers. 
So f(a) = g(a) for alli = 1, 2, .... It follows that f(x) = g(x) for all real «x. 


Putting Propositions 2.1 and 2.2 together gives us the result we need. 
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Theorem 2.3. Let f: R > R be a continuous function satisfying Cauchy’s 
equation 


f(at+y) = f(x) + fly) 


for all real values x and y. Then there exists a real number a such that 
f(a) =a2 
for all real numbers x. 


Proof. From Proposition 2.1, we see that there exists a real number a such 
that f(q) = aq for all rational numbers gq. But f(x) and g(x) = ax are both 
continuous functions. By Proposition 2.2, f(a) = g(a) for all real values x. 


Theorem 2.3 has a number of variants. For example, Problem 1 at the 
end of the chapter asks the reader to solve Cauchy’s equation when the do- 
main of f is restricted to be the nonnegative real numbers. A less immediate 
generalization is the following. 


Theorem 2.4. Let f: R—- R satisfy Cauchy’s equation. Suppose in addition 
that there exists some interval [c,d] of real numbers, where c < d, such that f 
is bounded below on [c,d]. In other words, there exists a real number A such 
that f(a) > A for alle <a <d. Then there exists a real number a such that 
f(x) = ax for all real numbers x. 


Problem 2 at the end of the chapter takes the reader through the steps of 
the proof. The assumption on f in Theorem 2.4 may not look as natural as 
the continuity assumption in Theorem 2.3. However, every continuous func- 
tion is bounded on closed finite intervals by the extreme value theorem. So 
the assumption in Theorem 2.4 is weaker. Another class of functions that is 
bounded on finite intervals is the class of monotone functions. We consider 
the consequences of this in Section 2.2. 


Example 2.5. If we ask for all solutions to Cauchy’s equation when there is no 
boundedness or continuity condition imposed, then many unusual functions 
other than f(x) = az will be found to satisfy the equation. The proof of this 
fact is beyond the scope of this discussion here. However, the basic idea is 
not too difficult. Consider all real numbers x of the form p+ qV2 where p 
and q are rational numbers. It can be seen that any such number is uniquely 
represented in the form p + qV2 in the sense that if 


ptav2=pt+eve, 


where p1, po and q1, q2 are rational, then p, = pz and q, = q. (Why?) We 
write the set of all such numbers as Q[V2]. Note that Q[./2] is closed under 
addition: if x and y are in Q[V2] then so is x + y. 

Define a function f : Q[V2| > R by 
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f(p+av2)=ap + ba, (2.7) 


where av/2 # b. The fact that elements of Q[\/2] have a unique representation 
shows that this equation makes f a well-defined function. Now f satisfies a 
restricted form of Cauchy’s equation: f(x + y) = f(x) + f(y) for all x and y 
in Q[V2]. Nevertheless, f is not the restriction of a linear function to the set 
Q{v2]. 

In order to find a function f : R — R that satisfies Cauchy’s equation but 
is not linear, we need to look for an extension of the function f above from 
Q/V2] to the entire real line. That is where all the hard work is. Hamel [1905] 
proved that there exist subsets H of real numbers such that any real number 
x can be uniquely written as a finite linear combination of elements of H with 
rational coefficients!! The reader who wishes to explore this subject further is 
referred to the appendix of this book on Hamel bases. 


2.2 Applications of Cauchy’s equation 


As we noted in the previous section, any continuous function f : R —> R is 
bounded (both above and below) on a closed finite interval, and the bounded- 
ness condition of Theorem 2.4 is more general than the continuity condition 
of Theorem 2.3. However, continuity and boundedness are sometimes hard 
conditions to prove.” In this section, we consider two results which are similar 
to Theorems 2.3 and 2.4. These results are not as powerful because the con- 
ditions imposed are rather strong. However, in some cases, it may be easier 
to check the conditions than those of our theorems in the previous section. 

In the first of these applications, we replace the continuity condition of 
Theorem 2.3 with an assumption of monotonicity. We show that the conclusion 
still follows. 


Proposition 2.6. Suppose f: R > R satisfies Cauchy’s equation 


f(at+y) = f(x) + fly) 


for all x and y, and is also monotone increasing in the sense that 


f(a) < fly) foralla<y. 


Then f(x) =ax for somea> 0. 


First proof. Of course, monotone increasing can be replaced by monotone 
decreasing with much the same conclusion except that a < 0. There are two 


' In our setting restricted to Q[V/2], above, the elements of Q[V/2] can be written 
as a finite linear combination of elements of {1, /2} with rational coefficients. 

? Even Cauchy had some trouble proving continuity when he was proving his version 
of the binomial theorem. 
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ways to prove this proposition. The first, and much longer way is to start 
from Proposition 2.1 which shows that f(q) = aq, a > 0 for all rational 
numbers q. After that, the monotonicity assumption can be invoked much as 
we invoked continuity to extend the statement from the rational numbers to 
the real numbers. For any real number x, we can write 


aq=f(q) sf) <fr)=ar 


for all rational numbers q, r such that q < x < r. Now consider a sequence of 
rational numbers q; < gz <--: < x, and another sequence of rational numbers 
rT, > 12 >---> 2x such that 


lim gq, = lim rn = 2. 
noo noo 


Then 


ax= lim aqr 
noo 


< lim f(rn) (monotonicity!) 


n> oCo 


So by the Squeeze Theorem, we conclude that f(x) = az. 


Second proof. However, a much easier proof follows directly from Theorem 2.4. 
Because the function f is monotone increasing, it follows that for any interval 
[c, d] of real numbers, we can set A = f(c). Then 


f(z) >A foralle<a<d. 


Thus the function satisfies the conditions of Theorem 2.4. We can conclude 
that f(z) = aa for some real number a. We can see that a > 0. This follows 
immediately from the fact that f is a monotone increasing function. 


Proposition 2.7. Suppose f: R > R satisfies the pair of equations 
fle+y) = f(x) + fy) (2.8) 
f(xy) = F(@) FY) (2.9) 
for all x and y. Then either 


f(z)=O0forallx or f(x)=~<a forall a. 


Note that the function has no continuity, boundedness, or other assumption 
imposed other than the two equations above. Once again, there are two ways 
to prove this. In this case, both are fairly easy. First, let us note that for any 
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number z > 0 there exists a w > 0 such that z = w?. Let us set « = w and 
y = w in equation (2.9), to observe that f(z) = [f(w)]? > 0. We can conclude 
that 

f(z) > 0 for all z > 0. (2.10) 


First proof. For our first method of proof, let us suppose that x < y. If we 
replace y by y — x in equation (2.8), we obtain 


f(y) = fle) + Fly— 2) 2 f(2), 


using property (2.10) to establish that f(y—«) > 0. It follows that f: R= R 
must be a monotone increasing function as in Proposition 2.6, above. Applying 
that result, we conclude that f(#) = az for some real number a. If we insert 
this formula into (2.9) using « = y = 1, we obtain a = a”. This equation in 
a has two solutions: a = 0 or a = 1. Plugging these solutions back in to (2.8) 
and (2.9), we see that they both work. 


Second proof. The second method of proof also uses (2.10), but in a more 
direct manner. It follows from (2.10) that for all 0 < c < d, we have f(x) >0 
for any « € [c, d]. Thus the function is bounded below on [c, d] by A = 0 as 
in Theorem 2.4. So we can conclude that f(a) = ax for some choice of real 
number a. The argument that a = 0 or a = 1 follows as in the first proof. 


2.3 Jensen’s equation 


Jensen’s equation has the form 


F (“*") _ f(z) +f) (211) 


and can be thought of as a version of Cauchy’s equation using averages. Once 
again, the function f is usually assumed to be continuous. For simplicity we 
assume that the domain of f is the entire real line. 

The solution to the equation is easily obtained from the results of the 
previous section. Let g(x) = f(«) — f(0). Then 


9 cry a g(x) + g(y) (2.12) 
2 2 
Moreover, g is a continuous function with g(0) = 0. Setting y = 0 in (2.12), 
we obtain (x) 
x g(x 
=) = 2.13 
7 @ 2 ats) 


As (2.13) holds true for all x, we can substitute « + y for x to obtain 
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o(2*) = er 


Substituting this back into the left-hand side of (2.12) reduces (2.12) to the 
equation 
g(x + y) = g(x) + gly) 


which is Cauchy’s equation. Therefore, there exists a constant a such that 
g(x) = ax for all real x. Letting f(0) = b, we see that the general form for f 
must be 

f(z) =axr+b. (2.14) 


2.4 Linear functional equation 


A plausible extension of both Cauchy’s and Jensen’s equations is 


fiax+by+c)=pf(x)+qfly)t+r. (2.15) 


For which choices of constants a, b, c, p, g, and r does there exist a continuous 
function f : R —> R satisfying (2.15), and what is the general form of the 
solution? 

The reader who wishes a challenge can solve this problem directly, finding 
thereby that the class of functions is not particularly rich. The reader who 
wants a few hints to get started can look at Problem 11 at the end of the 
chapter. 


2.5 Cauchy’s exponential equation 


Cauchy’s exponential equation has the form 


fle+y) = f(2) fy), (2.16) 


where f : R — R is assumed to be continuous and not identically zero. 
We encountered this equation in Chapter 1, where it arose as the functional 
equation used in proving the general form of the binomial theorem. 

Suppose there exists a value xo such that f(ag) = 0. Cauchy’s exponential 
equation implies that 


f(x) = f(a — 20) + xo] 
= f(a — x0) f(xo) 
=0 
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So any function which satisfies (2.16) and vanishes at a point must vanish 

everywhere. As f is assumed to be some function other than the function 

f =0, we see that we have a contradiction. So f(x) 4 0 for all real values «. 
In addition, (2.16) implies that 


f(z) = ly (5)) (2.17) 


As the right-hand side is strictly positive, we conclude that f(a) > 0 for all 
real x. 

Having established this, the rest is easy. As f is strictly positive, we can 
take logarithms. Define g(x) = log f(a). Then g is continuous and satisfies 
Cauchy’s equation. So g(x) = ax for some real value a. We conclude that 
there exists a real number b > 0 such that f(x) = b” for all real x. 

Before closing this section, it is worth mentioning one final equation of 
this type studied by Cauchy. This is the equation 


f(xy) = f(a) fy), (2.18) 


where x and y are positive real numbers. Letting z = logx and w = logy we 
see that this equation reduces to 


g(z + w) = g(z) g(w), (2.19) 


where f(x) = g(logx). But this is Cauchy’s exponential equation. If f(a) is 
continuous for all positive x, then g(z) must be continuous for all real z. We 
can conclude that either f is a power function of the form f(x) = x* for some 
real number k, or is equal to zero everywhere. Power functions of this kind 
have a role to play when we consider Euler’s equation in Section 2.10. 


2.6 Pexider’s equation 


Pexider’s equation has the form 


f(at+y) = g(x) + hy). (2.20) 


We are required to find all continuous functions f,g,h : R — R satisfying 
(2.20) for all real values of x and y. 

Pexider’s equation is a natural generalization of Cauchy’s equation. How- 
ever, as we show, it does not lead to a much richer class of solutions. Put 
y = 0 in (2.20), and set h(0) = b. Then 


g(x) = fe) —b. (2.21) 
Similarly, putting « = 0 and g(0) = a, we have 


hy) = f(y) —a. (2.22) 
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So (2.20) becomes 


fa+y) = f(z) + fy)-a-6. (2.23) 
Now let fo(z) = f(z) — a— for all real z. Then fo satisfies 
fo(@ + y) = fo(2) + foly) » (2.24) 


which is Cauchy’s equation. The continuity of f implies the continuity of fo. 
So solving Cauchy’s equation for fo we obtain fo(z) = cz for some real value 
c. So our solution to Pexider’s equation in (2.20) is 


f(z) =cz+atb (2.25) 
g(x) =cut+a (2.26) 
Aly) =cy+b, (2.27) 


where a, b,c are arbitrary. 


2.7 Vincze’s equation 


The next equation that we consider is a generalization of Pexider’s equation. 
Suppose we wish to find all solutions in functions f, g, h and k to the equation 


f(a + y) = g(x) k(y) + Aly) (2.28) 


for all real x and y. Once again, we impose an appropriate condition on the 
functions such as continuity. See Vincze [1962]. 

Let &(0) = a and h(0) = b. Now if a = 0, then f is a constant function. 
We dispense with this case henceforth, and assume that a 4 0. Putting y = 0 
in (2.28) we get 


g(x) = [f(x) — b]/a. (2.29) 
So we can rewrite (2.28) as 
fle+y) = oy) fl) +¥), (2.30) 
where 
o(y) = k(y)/a (2.31) 
and 
vy) = Aly) - ey) (2.32) 


With these substitutions, we have $(0) = 1 and ~(0) = 0. 
Now, let us define the function 


x(y) = f(y) — F(0). (2.33) 
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Then (2.30) can be rewritten as 


x(x + y) = o(y) x(x) + x(y)- (2.34) 


However, addition is commutative. Therefore, 


b(y) x(x) + x(y) = x(@+y) = x(yt 2) = O(z) x(y) + x(z)- (2.35) 
So we obtain 
[o(y) — 1] x(x) = [d(x) — Ixy). (2.36) 


This equation can be solved by breaking it into two cases. 


First Case: é(y) = 1 for all y. In this case, from (2.34) we see that y satisfies 
Cauchy’s equation y(a + y) = x(x) + x(y). So there exists a constant d such 
that y(x) = dz for all x. From (2.33) we determine that 


f(z) =date, (2.37) 


where c = f(0). Plugging (2.37) into (2.30) and using the fact that ¢ = 1 we 
can determine that 


vy) = dy. (2.38) 
From these solutions, we can obtain expressions for our original unknown 
functions, namely 


f(a) =dx+c (2.39) 
k(y) =a for all y (2.40) 
g(x) = (da +c-—b)/a (2.41) 
h(y) =dy+b (2.42) 


Second Case: (yo) 4 1 for some yo. Set y = yo in equation (2.36). In addition, 
set 


x(Yo)  _ 
a Pa (2.43) 
say. Then (2.36) becomes 
x(z) = 5[6(x) =). (2.44) 


Now we have excluded the case where f is a constant function from consid- 
eration here. This implies that y is not the zero function, and therefore that 
s #0. Let us put equation (2.44) into equation (2.34). We obtain 


s[d(a+y) — I = sy) [e@) - 1) + sey) - J. (2.45) 


As s £0, we can cancel s and reduce the equation to 
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oa + y) = O(x) Oy) (2.46) 


which is Cauchy’s exponential equation. 
So there must exist a positive constant t such that 


oly) =. (2.47) 


From this solution for ¢ we can obtain a solution to f using equations (2.44) 
and (2.33). We must have 


f(x) = st” +c. (2.48) 
Then using (2.30) we see that 7 can be written as 
vy) =c[1-t"]. (2.49) 


Having found solutions for f, ¢ and w we can finally write out solutions in 
terms of our original functions. We get 


f(z) =st* +c (2.50) 
k(y) =at¥ (2.51) 
g(x) = (st® +c—))/a (2.52) 
h(y) =c+(b-—c)t®. (2.53) 


Having solved Vincze’s equation, the reader may naturally wish to try to 
solve an equation of the form 


f(x + y) = g(x) k(y) + j(a) hy). (2.54) 


However, we have reached the limit for which this kind of generality is prof- 
itable. Equation (2.54) contains many interesting special cases, but unfortu- 
nately there are too many solutions to the equation. 


2.8 Cauchy’s inequality 


Just for a change of pace let us try a functional inequality known as Cauchy’s 
inequality.? Let us consider all solutions to 


fle+y) S f(a) + fy) (2.55) 


for all real values of x and y. Now it turns out—not unexpectedly—that there 
are too many solutions to this inequality for us to specify a particular family 
of functions as the general solution. However, let us include an additional 
inequality on the function, namely 


3 T know that this is a book about equations. But this one is too good to miss! 
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f(z) <x (2.56) 


for all real x. What solutions exist satisfying Cauchy’s inequality and this 
additional constraint? 
First, we set + = y = 0 in equation (2.55) to obtain 


FO) <2f0) or — f(0) 20. (2.57) 


Because we also have f(0) < 0 by (2.56), it follows that f(0) = 0. Next, set 
y = —& in (2.55). We get 


f(0) < f(@) + f(-2) 


or 
f(ey2—f(-2) (2.58) 
using f(0) = 0. Combining (2.56) with (2.58) we have 


f(x) 2 -f(-2) 2 -(-2) =2. 


Combined with the inequality in (2.56), we can conclude that f(x) = x. 


2.9 Equations involving functions of two variables 


Another way to extend Cauchy’s equation is to consider functions of two 
or more variables. In this section, we consider the two-variable case. The 
extension to more variables is routine. 

Suppose f(z,y) is a real-valued function of two real variables x and y, 
which satisfies 


f(t. + 22, y) as f(x1,y) Te f(x2,y) (2.59) 
f(x, y1 + y2) = f(x, y1) T f(x, ya) (2.60) 


for all real x, y, 41, £2, y1, and yo. In order to solve this equation, we also 
impose a continuity condition on f. It is sufficient to require that f be a 
continuous function of two variables. Although this assumption is a natural 
one, it turns out to be stronger than we need. In fact, it is sufficient to demand 
that f be continuous in each of its variables, a weaker condition, that says 
that if we fix x, then f(x,y) is continuous in y and vice versa. 

Treating y as fixed in (2.59), we can apply Theorem 2.3 to equation (2.59). 
From that theorem, we see that f can be written in the form 


f(x,y) =cly) x 


for all real x and y. Plugging this expression into (2.60), we get 


c(y1 + yo) @ = c(y1) & + c(y2) x (2.61) 
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for all x, y1, and yg. Setting « = 1, we obtain Cauchy’s equation again. As 
f(a, y) is continuous in the variable y, it follows that c is a continuous function. 
So we can conclude that there exists some real number cp such that c(y) = co y 
for all y. From this we can conclude that 


f(@,y) = cory (2.62) 


for all real x and y. 


2.10 Euler’s equation 


In this section, we continue our investigation of functions of two variables by 
considering a family of functional equations associated with the property of 
homogeneity. Let k be any real number. For given k, the equation 


f(tx,ty) =t* f(a,y) (2.63) 


for all positive x, y, and t, is called Euler’s equation. A function f(x) satisfying 
Euler’s equation is said to be a homogeneous function of degree k. For example, 


the function a 
Try 
flt,y) => 


is homogeneous of degree one, the function 


x 


F(2; y) eel 
y 
has degree zero, and the function 
f(a,y) =a? +2y?+7ary 


has degree two. 

How can we find the class of all functions which satisfy Euler’s equation 
for a given choice of k? This looks difficult, but is really quite easy. Let us 
substitute t = x into (2.63). It then becomes 


f(x,y) =a" file ty). (2.64) 


Suppose that we let g(z) = f(1, z). We can write 


y) (2.65) 


to obtain the solution to Euler’s equation. It can be seen that any function g 
defined on the positive real axis determines a homogeneous function of degree 
k, and vice versa. 

The simplicity of (2.65) suggests that it might be reasonable to generalize 
Euler’s equation to 


f(x,y) = x* g(a" 
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Fig. 2.2. Jean le Rond d’Alembert (1717-1783). 


f(ta,ty) = h(t) f(@,y), (2.66) 


where the function A(t) is given. However, it turns out that the generality of 
this modified Euler equation is a bit of an illusion here. Problem 7 at the end 
of the chapter asks the reader to show that if h is continuous (or bounded 
below on a closed interval) then h can be written in the form h(t) = ¢*, 
provided that f is not the function which is zero everywhere. 


2.11 D’Alembert’s equation 
It is now time to return to the analysis of D’Alembert’s equation 


g(x +y) + g(x — y) = 29(x) gly) (2.67) 


that was briefly discussed in Section 1.6 of Chapter 1. 

We consider the derivation of the full solution to this equation under the 
assumption that g(x) is continuous. Note first that if g(v) = 0 for all x, then 
g satisfies the equation. From now on, we assume that there is at least one 
value at which g does not vanish. 

Next set y = 0. The equation reduces to 


2 g(x) = 29(x) g(0). (2.68) 
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Assume that x is chosen so that g(a) # 0. Then we can cancel g(x) to conclude 
that g(0) = 1. On the other hand, setting x = 0, equation (2.67) reduces to 


gly) + g(-y) = 2.9(0) g(y) 
= 2g(y) (2.69) 


which implies that g(y) = g(—y) for all y. 
Next, let us set « = y. Using (2.68) we get 


g(2%) +1 = 2[9(x)]?. 


Equivalently, we have 


(ya en 


An equation such as this is already quite promising in suggesting the kind 
of double angle formula satisfied by cos x. However, it is also satisfied by the 
function cosh x. So we must be careful to determine the full set of solutions. 

Because g(x) is assumed to be continuous, and because g(0) = 1, there 
exists some value t > 0 such that g(x) > 0 for all x in the closed interval 
[—t, t]. We now break the argument up into two cases. 


First, we consider the case where 0 < g(t) < 1. So there exists some number 
0<s< 7/2 such that 
g(t) =coss. (2.71) 


From this statement, it is a straightforward argument by mathematical in- 
duction, using (2.70) to show that 


g (sm) = cos (=) (2.72) 


for m = 0, 1, 2, 3, .... To see this, note that the base case for the induction 
with m = 0 is simply (2.71). Suppose the result holds up to m. Then 


Coen)] = FCS) 


_ g(t/2™) +1 
= 2 

PAs 1 
= cos(s/2™) +1 by induction hypothesis 


2 
= cos” ( 


by equation (2.70) 


sat) (2.73) 


Both g(t/2’"*!) and cos (s/2™*') are positive. So we may take square roots 
of both sides to conclude that 


t Ss 
9( saat) = 08 (Sez) 
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as required. This completes the induction step, and the proof of equation 
(2.72) is complete. 
Next, let us return to equation (2.67), and make the substitution x = ny. 
This gives us 
g(r + 1) y] = 29(y) g(ry) — gl(n— 1) gy}. (2.74) 


Equations (2.72) and (2.74) can be used to construct a second argument from 
induction on n that 
t 
g (3) = cos (=) (2.75) 


for all integers m, n > 0. To see this, we note that the base case n = 0 follows 
from the fact that g(0) = 1, and the case n = 1 has already been established 
by the earlier induction argument. Suppose the result holds up to n. Then 


g tee =29(~)9(2)-s i by (2.74) 
fe | =29(an) #(Fe) ~9| ae | 


-1 
= 2 cos (=) cos (=) cos E arn | by induction hyp. 


(2.76) 


This completes the induction step, and equation (2.75) is proved. 
Since both s and ¢ are greater than zero, we can write s = at for some 
a > 0. So equation (2.75) becomes 


g ($) = cos (« z=) (2.77) 


for all n, m > 0. Using the fact that both g and cos are even functions, we 
can extend (2.77) so that 


g(x) = cos (az) 


for all x which can be written in the form « = +n27-™t for some m, n > 0. 
Numbers +n 27~™ are called the dyadic rationals. It is well known, and is not 
proved here, that the set of dyadic rationals is dense. Thus any real number 
can be written as a limit of a sequence of dyadic rationals. Using an argument 
which parallels Proposition 2.2, we see that g(x) = cos(a2) for all real x. 


The second case that needs to be considered is where g(t) > 1. We define 
the hyperbolic cosine function to be 
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= 
puriutriirids 


i=) 
n 
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Fig. 2.3. A few solutions to d’Alembert’s equation. 


L —2x£ 
cosh 7 = — . (2.78) 
It is straightforward to see that there exists some s > 0 such that g(t) = cosh s. 
From here on, the argument in this second case parallels the argument given 
in the first case above. So we omit the details. Note that the hyperbolic cosine 
function coshx satisfies the same sorts of trigonometric identities that the 
circular cosine function cos x satisfies, if allowance is made for the occasional 
change of sign in the formula. The rule for sign changes is known as Osborne’s 
rule: if the trigonometric identity has the product of two hyperbolic sines or 
two hyperbolic tangents, the sign on that term is reversed. For example, the 
summation formula for cos x is 


cos(u+v) = cosu cosv — sinu sinv. 
The summation formula for cosh x is 
cosh (u+ v) = coshu coshv + sinh u sinh v, 


where 


sinh xz = 


Upon completion of this part of the proof, we deduce that g(a) = cosh (ax) 
for some a > 0. 
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Figure 2.3 displays some of the functions that we have found to be solutions 
to d’Alembert’s equation in this section. Plotted are three circular functions 
cosn x, and three hyperbolic functions coshn x, for n = 1, 2, 3. 


2.12 Problems 


1. Let f : R* — R be acontinuous function, satisfying f(a@+y) = f(x)+f(y) 
for all x,y > 0. Prove that f(a) = az for some real number a. 


2. This problem asks the reader to fill in the details of the proof of Theorem 
2.4, above. Let f : R — R be a continuous function satisfying Cauchy’s 
equation. Suppose in addition that there exists some interval [c, d] of real 
numbers, where c < d, such that f is bounded below on [c, d]. 

a) Show that f(na) =n f(x) for all real 2. 

b) Define p = d—c. Show that f is bounded below on the interval [0, p]. 
(However, it need not be bounded below by the same constant as on 
the interval [c, d].) 

c) Define the function 


Prove that g is also bounded below on the interval [0,p| and satisfies 
Cauchy’s equation. 

d) Show that g is periodic with period p in the sense that g(a+p) = g(x) 
for all real x. Conclude from this, and the fact that g is bounded below 
on the interval [0,p| that g is bounded below on the entire real line 
(—00, +00). 

e) Suppose that there exists some 29 for which g(a) # 0. Prove a 
contradiction, by showing that the sequence of values g(n xo), n = 

1,+2,+3,... is not bounded below. 

f) Conclude that g(x) = 0 for all real x, and therefore that f(z) = ax 
for all real x, where a = f(p)/p. 


3. Find all solutions in continuous functions f,g,h : R — R to the functional 
equation 


f(x +y) = g(x) Aly). 


4. Consider continuous functions f : R — R such that 


f(a1) + f(x2) + f (xs) = flyr) + Flye) + f(ys) 


whenever 21 + %2 +23 = yi + y2 + y3 = 0. Prove that there exist real 
numbers a and 0 such that f(z) = aa + 6 for all real z. 
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5. 


10. 
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x Cauchy’s equation on a circle. Let T be the set of real numbers modulo 
one. That is, = y (mod 1) if and only if x — y is an integer. (We can 
conveniently think of T as the set of points of the interval [0,1) with the 
right-hand end “wrapped” around to the left-hand end of the interval.) 
Consider a continuous function f :'T — T such that 


f(@+y) = f(a) + f(y) (mod 1). 


Note that a function f is continuous on T if f is continuous on the interval 
(0, 1) and 
lim f(x) = f(0). 


x—1— 


Show that there exists an integer n such that f(#) = na (mod 1) for all 
ve T. 


. State and prove a version of Problem 4 above, for a continuous function 


f from T to T. 


. x Suppose that there exists a function h(t) defined for all positive t and a 


positive function f(x,y) defined for all positive x and y such that 


f(tx,ty) = h(t) f(a, y) (2.79) 


which is the modified Euler equation of Section 2.10 above. Suppose, in 
addition, that h is a continuous function. Show that h(t) = t* for some 
choice of k. 


. Suppose f : R > R has the property that there exists a constant K such 


that 
lf(x) — f(y)| < K (@-y)?. 


Prove that f is a constant function. 


. The property associated with Cauchy’s inequality, namely 


flet+y) < f(z) + fly) 


is known as subadditivy of the function f(x). Suppose we reverse the other 
inequality and assume that f(x) > x for all x. Does the conclusion f(x) = 
x follow in this case? 


* Determine which functions f(a) mapping the rationals to the rationals 
satisfy the functional equation 


fle+ f(y) = F(@) Fy). 


11. 


12. 


13. 


14. 
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In this problem, we attack the linear functional equation of Section 2.4. 
Suppose f : R > R is a continuous function satisfying 


f(ax+byt+c)=pflx)+afy)tr (2.80) 


for some choice of constants a, b, c, p, g, and r. The reader is warned that 
the steps below contain many hints. So the greater challenge is to solve 
the linear functional equation without reading the steps below. 

a) Prove the following equations 


f(0) = p(-e/a) +g f(0) +r (2.81) 
f(u) =f (—*) +4 f(0) +r (2.82) 
f(v) =p f(—e/a) + 4 f(v/b) +r (2.83) 

f(utv) =pf (—*) +qf(v/b) +r (2.84) 


b) Using (2.81)-(2.84), prove that f(u+ v) + f(0) — f(u) — f(v) =0. 

c) Conclude that f must have the form f(x) = sa +4 t for some constants 
s and t. 

d) Show that such a solution will work if and only if p =a and q = b. 


Find all continuous f : R — R such that 
f (Ve +) = Fo) FW) 
for all real x and y. 


Find all continuous functions f such that for all real 7 and y 


fle+y) f@—y) =[F(x) fm). 


x For any real numbers x and y, we define the quasiarithmetic mean of x 
and y with generating function f to be a function of the form 


in(eyy) = f°] 


where f is a continuous strictly increasing or strictly decreasing function. 
For example, the ordinary arithmetic mean is obtained by setting f(a) = « 
and the geometric mean by setting f(x) = log x. Other means that can be 
constructed include the harmonic mean (f(z) = 2~') and the root mean 
square (f(a) = x”). These last three definitions are usually defined only 
for « and y positive. 
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A question of immediate interest is when distinct functions f and g 
give distinct quasiarithmetic means. Suppose that 


p> | PRO) <1 [se ea 


Prove that there exist constants a 4 0 and 6 such that 


g(«) =a f(x) +0. 


In addition, prove the converse. Thus prove that two quasiarithmetic 
means are equivalent if and only if their generating functions are lin- 
early related. 
A quasiarithmetic mean is said to be translative if it has the property 
that 

m(x+t,y+t)=m(a,y) +t 


for all real values x, y, and t. Obviously, the arithmetic mean (f(x) = 
x) is translative. Is it the only one? It turns out that there is another 
family of translative quasiarithmetic means. 

Solve 


s 


- Feros Suen) _ fo oo 4 


to find all possible generating functions f. 
A quasiarithmetic mean is said to be homogeneous if it has the prop- 
erty that 

m(ta,ty) =tm(a,y) 


for all positive values z, y and t. Obviously, the geometric mean 
(f(x) = log x) is homogeneous. Find all others. 


15. « (The following functional equation requires calculus.) Find every twice- 


16. 


a) 
) 


differentiable real-valued function f with domain the set of all real num- 
bers and satisfying the functional equation 


[f(a]? — FM)? = fle +y) fe —y) 


for all real numbers x and y. 


x For each positive integer n, let f, be a real-valued symmetric func- 
tion of n real variables. Suppose that for all n and for all real numbers 
D15-- 


., In; Y, it is true that 
fet Aver BAD) a LAG 5 coy Ce) ee 


b) fn(—%1,.--,—-2n) = —fn(#1,---,2n). 


fnti(fn(21, rdf En) sey fr(©1, Pane ,XLn),Ln41) = fngi(“1, ads ;Ca ene 
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Prove that 
Lyte + ly 


n 


fn(21, ee 5a) => 


(Note that a function f is said to be symmetric if its value is independent 
of the order of its variables: f(x,y) = f(y,x) for two variables, and so 
on.) 
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Functional equations with one variable 


3.1 Introduction 


In this chapter, we consider functional equations involving a single variable. 
We have already considered these equations in Chapter 1, in the work of 
Charles Babbage. It is now time to examine some of these equations in greater 
detail. We begin by considering a useful technique known as linearization, 
which Babbage used to solve for iterative roots. After that, we look at some 
examples of conjugacy equations, which also arise from attempts to linearize 
functions. 


3.2 Linearization 


Sometimes, rather complicated-looking problems can be made quite simple. 
For example, suppose that we are asked to find a solution to the equation 


f(x’) — f(x) =1 (3.1) 


for « > 1. We can tell by looking at this equation that if f is increasing, it 
must do so rather slowly, as squaring x only makes f increase by one! It is 
rather tempting to try to change the problem to a function which increases 
faster. Let F(a) = f(a"), where a > 0. Equivalently F(log, 7) = f(a). Then 
for x > 0 the function F' satisfies 


F(2«) — F(x) =1 (3.2) 


for all x > 0. But this equation reminds us of the properties of a logarithm. 
The function F(x) = log, x satisfies equation (3.2). So 


f(x) = log, log, x (3.3) 


satisfies equation (3.1) for all a > 0. 
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This technique is known as linearization. It can be used to turn some more 
complicated functional equations into simpler ones. In the example above, 
the nonlinearity was in the domain of the function. However, in the following 
equation the nonlinearity is in the range of f. 


f(e+1) = [f(2))?- (3.4) 


We assume that f(a) 4 0 for all z. We can make this equation linear by letting 
F(a) = log, f(x), where, once again, a > 0. (Note that if the equation holds 
true for all x, then f must be strictly positive.) Then the equation becomes 
F(«# +1) =2 F(a), for which a solution F(x) = 2” can easily be found. Thus 
we can conclude that a solution to the original equation is 


f(x) =a”. (3.5) 


More generally, we seek to linearize a functional equation by replacing a 
function f by a function F’, where 


p(F[e(2)]) = f@)- (3.6) 


The functions p and ¢ are specially selected for the purposes of linearizing 
the equation. Unfortunately, many functional equations cannot be simplified 
by linearization techniques. However, it is worth watching carefully for com- 
plexity in an equation that can be eliminated by a simple transformation. 

The following problem is part of a shortlisted question from the 1997 
International Mathematical Olympiad. 


Example 3.1. Do there exist functions f : R > R and g: R- R such that 


flg(@)] =? and g[f(x)] =a" (3.7) 


for all zc € R? 


Solution. For the moment, let us set aside the question of whether f and g 
can be found for all real x. We concentrate on constructing the functions on 
the restricted domain [1, co). 

From the two equations we get 


F(x*) = Fol f(2))) = [F(@)P. (3.8) 


We can linearize the equation f(a*) = [f(2)]? by defining a function F' such 
that 
f(a) =a? Oe?) (3.9) 


where a,b > 0. Making this substitution in (3.8), and letting y = log, x, we 
get 
F(4y) =2 Fy) (3.10) 
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for all x > 0. This is easily solved with a function proportional to F(x) = /z. 
Thus 


f(x) = av! (3.11) 
for all x > 1 is a candidate. Now working the other way, we have 
g(x*) = 9(flg(@)]) = [g(a)}*- (3.12) 
We linearize using 
g(x) = cea *) (3.13) 


as before to obtain the functional equation 

G(2y) =4G(y). (3.14) 
A solution to this equation is G(x) = x. So 

g(a) = clea 2)” (3.15) 


for x > 1 is a candidate for a solution for g. In order to check properly we have 
to go back and plug these functions into the original functional equations to 
see if they work for any choice of a, b, c, and d. Plugging (3.11) and (3.15) 
into (3.7) we see that 


f(x) =2V%2* and g(x) = 16°82)” (3.16) 
is one possible pair of solutions to (3.7). 


It remains to extend f and g to the entire real line. This can be done, so 
as to satisfy (3.7). This is Problem 8 at the end of the chapter. 


3.3 Some basic families of equations 


One of the simplest families of functional equations that we can write has the 
form 


f(x) = fla(@)] (3.17) 


for all real x, where a: R > R is a specified function. What are the restric- 
tions on f if it is to satisfy such an equation? 

If we do not assume that f is continuous, then the complete family of 
solutions is easy to write down. First of all, let us write 


a’ (x) = a(z) and at! (x) = ala” (x)| (3.18) 


for n = 1,2,.... For convenience, we also define a° to be the function a°(x) = 
xz. We call the sequence 


a(x), a(x), a3(x), ... (3.19) 
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the splinter of x. Applying (3.17) a total of n times in succession, we see that 


f(x) = fla” (x)]. (3.20) 


Thus the function f is constant on the splinter of . 
We say that real numbers x and y are equivalent under iteration by a 
provided that there exist two integers n,m > 0 such that 


a"(a) =al(y). 


We write x ~q y when this is true. It is easy to check that ~q is an equivalence 
relation on R. That is, the relation ~, satisfies the following three properties. 


1. Reflexivity: «~, «x for all real x; 
2. Symmetry: « ~q y implies that y ~q z; 
3. Transitivity: c ~q y and y ~q z implies that x ~q z. 


For each real number , let us define the set A,(x) to be the set of all real 
values y for which y ~q x. That is, 


Aa(z) = {ye R:y~a 2}. 


The sets Aq(x) are the equivalence classes for the equivalence relation ~q, 
and are often called the orbits of a. It follows from (3.20) that the function 
f must be constant on each of the orbits of a. So a necessary and sufficient 
condition that f satisfy (3.17) is that we can factor f so that 


f(x) = g[Aa(2)] 
where g is some real-valued function defined on the collection of all orbits. 


Example 3.2. For example, the equation f(«) = f(a + 1) is of this kind with 
a(x) = «+1. In this case, the set A, consists of all y whose fractional part 
equals the fractional part of x. Equivalently, y € A, if and only if « = y (mod 
1). So the class of solutions f to the functional equations are the functions 


f(x) = g(a — [a}) 


where g : R > R is arbitrary. Here, [a] represents the greatest integer less 
than or equal to x. 


An interesting problem is to determine the family of continuous functions f 
which are solutions to (3.17). The assumption of continuity is often strong 
enough to force the solutions to (3.17) to be constant. For example, suppose 
that there exists some real number 2x9 such that 

lim a”(x) = 20 (3.21) 


n—- Co 


for all real x. Then 
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f(to) =f lim a"(2)] : 


n> co 


However, if f is a continuous function, then we can “pull” the limit to the 
outside. So 


f(zo) =f lim a"(2)] 


n> co 


= lim flo"(2)] 


f(x) 


for all x. In other words, f must be constant. Note that this argument assumes 
that xo lies in the domain of f. A completely different situation can arise if 
Zo is not in the domain. 

If the splinter a”(a) has a limit we can determine its value by checking 
for fixed points under iteration by a when a is a continuous function. For 
example, suppose 2p is the limit of a”(x) as in (3.21). Then 


to = lim a”(x) 
n—->co 


=: BR n+1 
a ae 
= Jim ala” (a)| 


=a lim a"(2)] 


n> co 
= a(xo). 


Note that we can “push” the limit through a because the function a@ is con- 
tinuous. Also, note that being a fixed point under iteration by a is a necessary 
but not sufficient condition for the convergence of a” (x). 

In general, if f is a continuous function satisfying (3.17), then f will not 
only be constant on the orbits Ag(x) for each x, but will also be constant 
on the closure of each of these orbits. Rather informally, we can say that the 
closure of any orbit Aq(x) is the set of points in A,(«) together with the set 
of all limit points from sequences chosen with values from Aq(x). So f will be 
a constant function if the closure of Aj(«) and Ag(y) have a point in common 
for any x and y. 

To illustrate these ideas, let us consider problem A6 from the 1996 Putnam 
Competition. 


Example 3.8. Let a be any positive real number. Find, with proof, all contin- 
uous functions f : R + R such that f(x) = f(a? + a), for all real z. 
Solution. First note that 


f(-«) = fl(-2)’ + a] = f(x’ + a) = f(a). (3.22) 


Thus it suffices to determine the value of the function for x > 0, as the 
relation above immediately determines the value of f(a) for negative values 
of x. Henceforth, assume that x > 0. 
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For each 7 we need to examine the limit points of the orbit A,(az) where 
a(x) = 2? +a. (3.23) 
Solving a(ao) = Xo, we observe that we have the following cases: 


1. a = 1/4. In this case, a(x 9) = vp has the unique solution x) = 1/2. If 
0<a< 1/2, then 
u<a(r) < 1/2. 


So in this case, the splinter a” (a) is an increasing sequence bounded above 
by 2g. Therefore, the sequence a"(x) has a limit. As zo is the only fixed 
point of a, the splinter of any x < 1/2 must have x = 1/2 as its limit. 
So the closure of every orbit A(x) will contain xo. It follows that f must 
be constant on the interval [0, 1/2]. 

On the other hand, if « > 1/2, then we also see that a”(z) is an increasing 
sequence, which must be unbounded. So the orbit of any point x > 1/2 is 
the set 


A(x) = {a"(x):n =0,+1,+2,...}, 


where 
a \(z) = Vz—a>2Q. 


Therefore, when x > 1/2, the sequence a~"(a) is a decreasing sequence 
which is bounded below by zo. For reasons similar to the previous case, 
this sequence must converge to xo. Thus we can conclude that the closure 
of this orbit contains the point 29 = 1/2 as well. So f must be constant on 
the interval [1/2,00). It follows that f is constant on the interval [0, 00). 
As f(—x) = f(a), the function f is constant on the entire real line. 

2.a < 1/4. When a is in this range, the equation x? — x + a = 0 has two 
distinct solutions, say 


xr, = (1- V1 —4a)/2 and Lo = (1+ V1 —-4a)/2. 


If « < x, the splinter a”(x) is an increasing sequence (although not nec- 
essarily strictly) with limit point x1. So 2 is in the closure of A(z). 
Additionally, if 71 < x < 22, then the splinter a”(x) is monotone de- 
creasing with limit point x;. Therefore f must be constant on the interval 
(0, x9] . 
Similarly, if 2 > x2, the sequence a~"(x) is decreasing with limit point 
x. It follows that f is constant on the interval [22, 00). 
Putting these facts together and using f(—«) = f(x) we conclude that f 
is constant on the entire real line. 

3. a > 1/4. In this case, a has no fixed points. For every real «x, the splinter 
a(x) is a strictly increasing sequence that is not bounded above. More- 


over, the function 
a(x) = /z—a 
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is only defined if x > a. So the orbit A(x) of any point x contains no 
limit points. 

To find all functions f we divide the nonnegative real numbers into the 
intervals 


[0, «(0)], [a(0), 07(0)], [o*(0), a*(0)), 


and so on. Let g be any continuous function on [0,a(0)] such that g(0) = 
g(a(0)). Now the reader can check that, for any positive x, there exists a 
unique y € [0,a(0)) and a nonnegative integer n such that z = a"(y). We 
define f in such a way that f(x) = g(y). We extend f to negative x by 
defining f(«) = f(—x) when x < 0. 

It can be checked that any f so defined satisfies the functional equation 
and is continuous. Furthermore, any continuous solution to the functional 
equation can be represented in this form. 


This problem is a reminder to us that unless we can enforce some strong 
condition such as constancy, we are likely to find that a functional equation in 
a single variable has infinitely many solutions. The standard way to enforce a 
unique solution is to impose additional constraints on the function. In some 
cases, those additional constraints can be in the form of additional functional 
equations. 

An interesting exception to this rule occurs when a functional equation 
involves an iterative root of the identity, also known as a cyclic function. 
Suppose there exists some positive integer n such that a" (x) = x. Clearly, an 
involution is a special case where n = 2. The reader will recall functions of 
this sort from Chapter 1, where we reviewed the work of Charles Babbage. 
In Chapter 1, it was the unknown function f which was presumed to be an 
iterative root. In the present case, the function a(x) given in the equation is 
the iterative root. Because a(x) is cyclic, the splinter is finite. The following 
example illustrates a method for solving functional equations such as this. 


Example 3.4. Consider, for example, the equation 


2F(e) + fe) =e, 


where z # 0. We replace x by x~! in this equation to obtain a second equation. 
Letting a(x) = «~' we note that a?(x) = 2. So we also have 


2 f(a) + f(x) =a7?. 


Multiplying the first equation by 2, and subtracting the second equation from 
the first, we get 
207-1 
Nae 


It can be checked that this satisfies the original equation. 
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Example 3.5. More generally, we may suppose we have an equation involving 
the terms f(aJ(x)), where a(x) = x. Suppose that the equation is linear in 
the n unknowns 


f(x), F(a(z)), ..., f(a®“*(a)). 


By successively substituting a/(x), 7 < n for x in the original equation, we 
obtain a set of n equations in the n unknowns f(a/(x)), 0 <j <n. We can 
solve for the n unknowns using standard methods of linear algebra. 


Problem 11 at the end of the chapter is of this kind, and appeared as Problem 
B2 on the 1971 Putnam competition. 


3.4 A menagerie of conjugacy equations 


The form of equation (3.17) precludes the possibility that there are any one- 
to-one functions which are solutions to it. However, (3.17) has a number of 
generalizations which can have one-to-one solutions, and which are sufficiently 
important to be given special names. 

The equation 


fla(a)] = s f(x) (3.24) 


is called Schréder’s equation. For a given function a, we are required to find 
one or more functions f and a real number s # 1 such that (3.24) holds. 

We have to be a bit careful about the domain of definition of any solution 
to (3.24). For example, suppose there exists a fixed point of the function a. 
If zo is such a value, then making the substitution a(ao) = xo in Schréder’s 
equation leads us to the identity f(%o) = s f(ao). As s £1, we must either 
conclude that f(ao) = 0 or that xo is outside the natural domain of definition 
of the function f. The latter is quite common in solving Schréder’s equation. 
To avoid this problem, we can assume that the equation a(x) = «x has no 
solutions on some interval on which a function f(x) is to be constructed. A 
common assumption is that 0 < a(x) < x for all x in some interval (0, a) 
where 0 < a < oo, and 0 < s < 1. These assumptions are not as restrictive as 
they look, because the equation can often be transformed (e.g., by substituting 
x =a'(y), etc.) to an equation of this kind. If we can find a positive solution 
to (3.24) with any choice of 0 < s < 1, then we can easily find solutions for 
other values of s. For example, if f(x) solves (3.24), then [f(«)]? solves (3.24) 
with s replaced by s?. 

A kind of inverse equation to Schréder’s equation is obtained as follows. 
Let f be a solution to (3.24) which admits an inverse: a function g = f~! 
such that f(a) = y if and only if g(y) = x. Then g satisfies the equation 


g(s@) = alg(@)). (3.25) 


Equation (3.25) is referred to as Poincaré’s equation. 
The equation 
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Fig. 3.1. Niels Henrik Abel (1802-1829) 


fla(x)] = f(x) +a (3.26) 


is called Abel’s equation. The real number a is assumed to be nonzero, and is 
either given or to be determined along with f. As with Schréder’s equation, we 
must once again be cautious about the domain of definition of f. If we choose 
a value xo such that a(x) = Zo, then we see that x9 cannot lie in the domain 
of f. Otherwise we would obtain the equation f(@o) = f(zo) + a, which is 
clearly impossible if a is assumed to be a nonzero value. So the restriction of 
the domain of definition of f(a) to values x such that a(x) 4 x is required. 
The equation 


fla(w)| = [F(a (3.27) 


where p 4 1, is B6ttcher’s equation. For this equation, we are interested in 
nonnegative functions f(x). 


One additional equation worthy of note is the commutativity equation de- 
fined by 


fla(x)] = a[f(x)]. (3.28) 
For example, Problem 1.11 in Chapter 1 involved two simultaneous commu- 
tativity equations. 
All the equations that we have considered above are special cases of a 
family of equations called conjugacy equations, which take the form 


fla(x)] = SIf(a)] (3.29) 
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for appropriately chosen functions a and @. For example, when a = ( we 
have the commutativity equation, when G(x) = sx we have Schréder’s equa- 
tion, and so on. We do not attempt to solve conjugacy equations in their full 
generality below. However, we do look for solutions to their special cases. 

When (3.29) is satisfied by some one-to-one function f, we say that a and 
GB are conjugates of each other. When Schréder’s or Abel’s equations can be 
solved in this way, then the function a is shown to be conjugate to a linear 
function. Typically, linear functions are easier to study than nonlinear ones. 
So it is very important in mathematical sciences to be able to establish that a 
nonlinear function is conjugate to a linear one. When this is the case, we are 
able to linearize complex mathematical relationships and study them using 
linear methods. For example, Abel’s equation 


flo(«)] =«+a 


uses a function f to establish a conjugacy between a given transformation 
x + a(x) and a simple shift transformation 4 «+a. As the latter trans- 
formation is particularly simple, we are often able to solve problems with the 
shift transformation and then use the conjugacy to “transfer” the solution 
back to a. If a is a nonlinear transformation, then Abel’s equation, and its 
solution f have the effect of linearizing x ++ a(x). Of course, the solution 
f to Schréder’s equation also linearizes x ++ a(x) by establishing a conju- 
gacy between this transformation and the scale change x +} sa. Which of the 
two linearizations is more appropriate, if either, depends upon the particular 
character of a. 

This idea of using conjugacy between two functions was essential to Bab- 
bage’s solution to the problem of finding iterative roots: the iterative root of 
a shift transformation or a scale change is easy to define. We use conjugacy to 
transfer such solutions to more complicated functions. We return to consider 
iterative roots in Section 3.7. 


3.5 Finding solutions for conjugacy equations 


3.5.1 The Koenigs algorithm for Schréder’s equation 
First note that if f(x) is any solution to Schréder’s equation 


fla(x)] = s f(x) 


then so is any constant multiple of f(x). 

If the splinter of a behaves approximately as a geometric sequence, a 
solution can be found. The splinter a” (x) is said to be approximately geometric 
if there exists a number s € (0,1) such that 


ee) (3.30) 
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exists, is finite, and nonzero. In this case we say that the splinter has rate s. 
On a domain of values x where the splinter of x is approximately geometric 
with rate s independent of x, a solution to Schréder’s equation (3.24) is given 
by 
a” (a 
f(z) = lim ee) 


noo =g” 


(3.31) 


for the particular choice of s which is a witness to the property (3.30). This 
is easily checked by direct substitution into Schréder’s equation. We have 


fla(x)] = Jim 2) 
n+1 
Tye a 


=s f(a). 


The particular method given in (3.31) for generating a solution is known as 
the Koenigs algorithm. See Problem 6 for an example of a functional equation 
that can be solved using the Koenigs algorithm. 

It is possible to generalize this solution. Provided that the limit 


n+1 
s= lim © (x) 


noo Q” (x) 


(3.32) 


exists for all x on some domain, and is independent of x, then we can check 
that for each xo in that domain 


f(x) = lim a) 


n—0o a” (x9) 


(3.33) 


is a solution to Schréder’s equation for each x9, provided the limit exists. We 
have 


= lim oe) 
fla(x)| = im, a” (x9) 
— tem [OE @) , a2) 
neo] a(x) a(x) 
= lim oP (a) im ae) 
= oo an(a per a(x) 
= 5 f(z) 


This function is called the principal solution to Schréder’s equation. Note that 
the choice of x9 need not bother us unduly here. If we change the value of xo, 
the effect of this is to multiply the principal solution by a constant. So the 
principal solution can be said to be uniquely defined except for an arbitrary 
multiple. 
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3.5.2 The Lévy algorithm for Abel’s equation 

We consider the special case of Abel’s equation where a = 1, so that 
fla(x)] = f(#) +1. 


Note that if f(a) is any solution to Abel’s equation (3.26), then so is f(a) +c¢ 
for any constant c. 

If the function a(x) is approximately geometric, it may be most appro- 
priate to transform Abel’s equation to Schréder’s equation and to find the 
principal solution as outlined in the previous section. Alternatively, it may be 
that the function a(#) behaves approximately as does the function +> «+a. 
In this case, the most convenient formulation of the functional equation is as 
a form of Abel’s equation. 

However, suppose there exists an x9 such that 


=1 (3.34) 


(3.35) 


exists, it is a solution to Abel’s equation f[a(x)] = f(a) +1. To see this, write 


x)| = lim at} (x) — a" (ao) 
Flo(@)] = EB, amFB(a) — (a) 
) 


The limit in the second term is seen to be one by replacing x with a(x) in 
(3.34). Equation (3.35) is known as the Lévy algorithm for Abel’s equation. 


3.5.3 An algorithm for Bottcher’s equation 


If f(a) is any solution to Bottcher’s equation (3.27), then so is the function 
[f(x)]? for any power q. Bottcher’s equation may be the natural way to formu- 
late a linearization equation when the function a(x) behaves approximately 
as a power function. 

A solution to Béttcher’s equation can be obtained if the limit 


f(z) = lim [a”(x)P ” (3.36) 


n> Co 


exists. This can be checked by plugging into (3.27). 
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3.5.4 Solving commutativity equations 


It is easily seen that the functions f,(x) = a”(x) satisfy the commutativity 
equation 

fla(x)] = a[f(x)] 
for n = 0,1,2,.... However, there may well be many other solutions. One way 
to generate a solution is through a solution to the corresponding Schroder, 
Abel, or Béttcher equation. For example, suppose that g satisfies Schréder’s 


equation: gla(x)] = s g(a). Furthermore, suppose that g is one-to-one with 
inverse g~ +. Then for any constant c the function 
f(x) = 9™"[eg(2)] (3.37) 


must satisfy the commutativity equation. This follows easily using the fact 
that g—! satisfies Poincaré’s equation. We have 


fla(x)] = g~*[eg(o(x))] 


g 
g ‘lesg(2)] 


I 


I 


I 


Ig” *(eg(a))] 
[f(zx)]. 


Similarly, if g satisfies Abel’s equation g[a(x)] = g(x) + a then, for every 
constant c, the function 


f(x) =9""[9(2) + (3.38) 


will satisfy the commutativity equation. 
Finally, if g[a(x)] = [g(a)]? then the function 


F(x) = 9 "{l9(@)}} (3.39) 


will satisfy the commutativity equation for all c. 
It can be seen that these arguments extend. In general, if f is a solution 
to the conjugacy equation 


then solutions to the commutativity equation f[a(x)] = a[f(x)] can be found 
by extending the function (. 


3.6 Generalizations of Abel’s and Schroéder’s equations 
In this section we consider two generalizations of Schréder’s and Abel’s equa- 


tions which fall outside the class of conjugacy equations. 
The equation 
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fla(x)] = f(@) + h(a), (3.40) 
where h(x) is a given function, is a generalization of Abel’s equation. As with 
Abel’s equation, an arbitrary constant can be added to any solution to create 
another solution. It can be checked that for any xo, the function 


f(z) = S3 {hla”(ao)] — hla” (2)]} (3.41) 
n=0 


is a solution to this equation provided that the infinite sum converges. 


Example 3.6. For example, as a solution to the equation 
f(22)=f(x)+2', xz>0 


this algorithm gives 


iy = 2g ES 2-ae) (3.42) 
n=0 
=ec-2a7 (3.43) 
where c = 22% ' is arbitrary. 
The equation 
fla(a)] = g(x) f(x), (3.44) 


where g(x) is a given function, is a generalization of Schréder’s equation. It 
can be checked that for any xo, 


_ _ gla” (xo)] 
ican { gla"(z)] } er) 


n=0 
is a solution to this equation provided that the infinite product converges. 


Example 3.7. As an illustration of this formula, the algorithm solves the equa- 
tion 


f(a/2) =cosa - f(a) (3.46) 
by functions of the form 
_ Tq [cos(2-” x0) 
f(z) = U ea (3.47) 
= dz csc 2a (3.48) 


where the constant d = x5 ! cos zg sin x is arbitrary. Note that to obtain step 
(3.48) above, we need to use the formula 


sin x 


ieee rae (3.49) 
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3.7 General properties of iterative roots 


In this section, we return to Babbage’s problem of finding the iterative roots 
of a function that was discussed in Chapter 1. Here we examine some general 
questions about iterative roots. We also return to Babbage’s original problem, 
and see how the conjugacy equations of Abel and Schréder can be used directly 
to find iterative roots. 

First, we recap the basic definitions. Let h(x) be a given function. Any 
solution f(x) to the equation 


Ff [f(@)| = h(a) (3.50) 
is called an iterative square root of the function h(x). 
Define 
fi(a)=f(z) and f(a) = FL f"(2)| (3.51) 
for n > 1. We shall call a solution to the equation 
f"(x) = h(2) (3.52) 


an iterative n‘” root of h(a). In the particular case where h(x) = x for all z, 
we call equation (3.52) the Babbage equation. When f is a solution to the 
Babbage equation with n = 2, then we call f an involution. 

In general, a solution to (3.52) may not exist, and even when it does exist, 
it usually is not unique. For example, the function f(x) = —z is an involution. 
But so is the function f(a) = a — «x for every real number a. Indeed, involu- 
tions are plentiful, easily constructed, and all satisfy the functional equation 
fl f(x)] = a. In general, any function of the form 


f(x) = 9"'|-g(2)] (3.53) 


will be an involution. 

Fortunately, however, we are often in a position to say something about the 
properties of f(a) even when we cannot determine it completely. For example, 
suppose that h maps the set of real numbers onto itself. That is, suppose that 
the range of the function h is all of R. It follows from this that the range of 
f must also be all of R (i.e., that f itself must be onto). 

Another property that f “inherits” from h is the property of being one- 
to-one. For if h is one-to-one and f(x) = f(y), it follows that f"(x) = f"(y). 
So h(x) = h(y), which implies that « = y. A case of particular importance 
is when f is assumed to be a continuous function. A one-to-one continuous 
function is necessarily either strictly increasing (i.e., f(x) < f(y) whenever 
x < y) or strictly decreasing (i.e., f(x) > f(y) whenever x < y). The proof of 
this fact is left as Problem 5 for the reader. 

Putting these results together, we obtain the following result. 


Proposition 3.8. Let h be a one-to-one function. 
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1. Suppose n is even and that there exists a pair of real numbers x < y such 
that h(x) > h(y). Then there is no continuous iterative n‘” root of h. 

2. Suppose that n is odd, and that f is a continuous iterative n*” root of h. 
Then h is continuous and strictly increasing or strictly decreasing as f is 
strictly increasing or strictly decreasing, respectively. 


Proof. Suppose f"(x) = h(x) for some continuous function f. Then, as indi- 
cated above, the function f is strictly increasing or strictly decreasing. This 
implies that f? is strictly increasing, and therefore that h = f” is strictly 
increasing for all even values of n. However, this contradicts the statement in 
part 1 that x < y and h(x) > h(y). So in part 1, there can be no continuous 
solution f. 

On the other hand, suppose n is odd, and f is monotone. Then, as n —1 is 
an even number, f”~! is strictly increasing (including f°(2) = x). Therefore 


h(a) = f"(x) = ff" (2)] (3.54) 


is strictly increasing or strictly decreasing as f is respectively. 


Using Proposition 3.8, we can now solve the following problem which ap- 
peared as Problem A2 on the 1979 William Lowell Putnam Competition. 


Example 3.9. Establish necessary and sufficient conditions on the constant k 
for the existence of a continuous real-valued function f(x) satisfying 


fl f(x)] = ka? 


for all real x. 


Solution. This can be written in the form f"(#) = h(x) with n = 2 and 
h(x) = kx®. If k < 0, then h(x) = kx® is monotone strictly decreasing. By 
Proposition 3.8, there can be no such continuous function f(x) because n = 2 
is even. So we have k > 0. To prove that this is possible, it suffices to exhibit 
an f(a) for any such k. It is easily seen that 


f(z) = ki/4 73 


works. 


Can we actually construct iterative roots? In many cases yes. For example, 
suppose we can construct a one-to-one solution g(x) to the Abel equation 
g[h(x)] = g(a) + 1 for a given function h(a). Define 


= 1 
fee) =o fala) +2] (3.55) 
Then it can be checked that f(z) is an iterative n‘® root of h(x). Similarly, if 


we can solve the equation g[h(x)| = s g(a) for some s 4 1 with a one-to-one 
function g, then 
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f(a) = 97* |s\/" g(2)| (3.56) 


is an iterative n*® root of h(a). The reader should recognize that what these 
formulas implicitly do is linearize the function h(x) and apply an obvious n‘® 
root to the linearized function. 

Another way to tackle an iterated root equation f”(x) = h(x) is to apply 
the function f to both sides of the equation. We get 


f[h(z)] = fF" @) = FF (@)] = AIF). (3.57) 


This turns an iterated root equation into a commutativity equation. The so- 
lution to the iterated root equation will solve the commutativity equation but 
the converse usually is not true. However, the commutativity equation may 
be helpful in narrowing down the possible solutions, and it is often easy to 
analyze. The following example illustrates the idea. 


Example 3.10. Find all integer-valued functions f(n) taking values in the in- 
tegers satisfying the equation f[f(n)] =n-+ 1. 


Solution. Applying f to both sides, we obtain f(n +1) = f(n) +1. From this 
it is immediate that f can be written in the form f(n) = n+ m for some 
integer m. However, if this is the case, we have 


flf(n)]=n+2m=n41 


which has no solution in integers m. Therefore there is no integer-valued 
function f(n) satisfying the equation. 


Such a method can also be applied to the composition of two or more different 
functions. For example, we could have simultaneous equations of the form 


flg(a)]=hi(a) and g[ f(a)] = ha(a), (3.58) 


where h, and he are given, and f and g are unknown. We apply g to the first 
equation to obtain 


g{hi(x)] = 9(flo())) = halg(x)] (3.59) 
and apply f to the second to obtain 
f[h2(x)] = flglf(2))) = hilf (2) - (3.60) 


The result is a pair of simultaneous conjugacy equations to solve. 

Let us close this section by considering the following problem. It was 
part of the same shortlisted problem at the 1997 International Mathemati- 
cal Olympiad as was given in Example 3.1 earlier. 
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Example 3.11. Do there exist functions f: R — R and g: R > R such that 


fig(@)J=2? and = g[f(z)] = 2° (3.61) 
for all e € R? 
Solution. No. Suppose f and g were solutions. As the function ho(x) = x? is 
a one-to-one function, it follows that f is a one-to-one function. Thus f(1), 
f(—1), and f(0) are three distinct numbers. On the other hand, one of the 
associated conjugacy equations states that [f(x)]? = f(a). Inserting x = 
—1,0,+1 into this equation we get that each of f(—1), f(0), and f(1) equals its 
square. Thus each of these three numbers must equal 0 or 1. By the pigeonhole 
principle, two of f(—1), f(0), and f(1) must be the same. This contradicts 
the fact that f is a one-to-one function. 


3.8 Functional equations and nested radicals 


Let us return to the nested radical proposed by Ramanujan, and described in 
Chapter 1. We wish to compute 


Jrsayieayreay (3.62) 


As previously, we generalize this problem to solving the nested radical 


fla) = fi tayi+ ety (3.63) 


Squaring both sides, we see that f(x) must satisfy the functional equation 


f(a)? =1+af(x+1), (3.64) 


where f(x) > 0. Setting « = 0, we determine that f(0) = 1. Unfortunately, 
we cannot get f(1) by recursion. 

This equation does not fit into any family of equations that we have man- 
aged to solve so far. How can we find a solution, and how can we determine 
that any solution we find is the correct solution to the original nested radical? 
We can try and guess at a solution. Let us look for a polynomial f(a) which 
solves this. (See the section on polynomial equations in the next chapter.) If 
f(x) were a polynomial of degree n then the left-hand side of our functional 
equation would be of degree 2n and the right-hand side of degree n+ 1. So 
our polynomial would have to be of degree one. Let f(x) = ax +b. We get 


(ax+b)?=14+a2(ar+a+b) (3.65) 


or 
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diabiaenls i a 


1729 = 19+ 12°=9°+ 10° 


Fig. 3.2. There are many formulas associated with the legendary Indian mathe- 
matician Srinivasa Ramanujan (1887-1920). Here are two of the simpler ones, the 
first of these being our main interest here. The second formula is associated with one 
of the most famous and touching stories of the collaboration between G. H. Hardy 
and Ramanujan. While in England, Ramanujan became sick with the disease that 
prematurely ended his life. He was admitted to a nursing home in Putney called 
Colinette House. Hardy, visiting him there, and apparently at a loss for substantial 
conversation, remarked that the taxicab number was 1729. “A rather dull number,” 
Hardy added. “No, Hardy,” replied Ramanujan. “It is a very interesting number. It 
is the smallest number expressible as the sum of two cubes in two different ways.” 
Nowadays, the number 1729 is known as the Hardy—Ramanujan number. 
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a’ x* + (2ab)2 +b? =an2?+(at+b)et+l1. (3.66) 


If this is to be true for all x, then the corresponding coefficients on each side 
must agree. We are rather fortunate that the assignment a = 1 and b = 1 
works! 

So f(x) = «+1 is a solution. But is it the correct solution? That is, is the 


equality 
rti=yiteyi+@tDy— (3.67) 
correct? 


We can prove that it is. To do this note that with f(x) defined as in (3.63), 
we have for all x > 1, 


f(x) > \far\/avVa... (3.68) 
= p(t/2)+(1/4)+(1/8) + (3.69) 
=f (3.70) 
> (x+1)/2. (3.71) 
Bounding on the other side, we have 
fla) < e+ 1) (@ +2) V@+3) (3.72) 
< Je +1) (20+) V8@4+) (3.73) 
=(2+1)V1V2v3... (3.74) 
<(e+1)\/1V2Vv4... 3.75 
=2(x+1) 3.76 
Thus se 
- 5 S fle) S$ 2(e+1) (3.77) 
The following result takes us home. 
Proposition 3.12. Let f(x) satisfy the functional equation 
[f(#)? =1+«f(e+1) (3.78) 
and the inequalities 
1 
“S < f(x) <2(e+1) (3.79) 


for alla >1. Then f(x) =a+1. 
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Proof. Replacing « by « + 1 in equation (3.79), we get 


PH? < fet) <2(e +2). (3.80) 
From (3.78) we have 
Eafe Thao SO ep AY (3.81) 


2 
Applying (3.80) to (3.81), we find that 


1+a(a +2) 
2 


Taking the square root of both sides, we have 


< (f(z)? < 2[1+2(@+2)). (3.82) 


2-¥2 (142) < f(x) < 2/7 (142). (3.83) 


Compare (3.83) with (3.79). Our bound has improved on each side! Applying 
this argument k times we get 


a-V/* (14.2) < f(a) <2" (142). (3.84) 


Letting k > co gives us 1+ a < f(x) < 1+ which proves the result. 


3.9 Problems 


1. Find, with proof, all continuous functions f : R — R such that f(22) = 
f(a) for all real x. 


2. Give an example of a continuous function f : R > R such that f(#+2) = 
f(x) +2 for all real x, which is not a monotone function. 


3. * Find, with proof, all continuous functions f : R — R such that f(x) = 
f(cos x) for all real x. 


4. The gamma function I(x), x > 0, is defined by 


nl n® 


ee eae G +2 Gan) 


I'(a) = 


(Compare this formula with those in Section 3.5. You may assume that 
the limit exists.) Find a functional equation involving ['(#) and ['(a +1). 


5. Let f : R — R be a continuous one-to-one function. Prove that f is 
either strictly increasing (i.e., for all x < y we have f(x) < f(y)) or 
strictly decreasing (i.e., for all x < y we have f(x) > f(y)). 
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6. 


10. 


11. 


12. 


3 Functional equations with one variable 


Use the Koenigs algorithm to find a function f other than the trivial zero 
function which satisfies the equation 


5(f2)=s8@), 


where 0 < s < 1. Specify carefully the domain over which f is defined. 


. Find as many functions f as you can which satisfy the commutativity 


equation 


f (<4) 7 oeee 


Relate your solution to the solution to the corresponding Abel equation 


o(-4) =o 41. 


. Complete the proof in the solution to the IMO shortlisted problem of 


Example 3.1. 


. Let f: R- R be a continuous function satisfying the pair of functional 


inequalities 
flee) sf) = fle+ 0). 


a) Let a= V2 and b=1. Prove that f is a constant function. 
b) State general conditions on a and b used in the proof above which 
ensure that f is constant. 


Consider a polynomial f(a) with real coefficients, having the property 


for every polynomial g(x) with real coefficients. Determine the nature of 


f(x). 


Let f(x) be a real-valued function defined for all real x except for + = 0 
and x = 1 and satisfying the functional equation 


f@)+f[@—D/s]=14+2. 


Find all functions f(a) satisfying these conditions. 


x Does there exist a function f : R — R which simultaneously satisfies 

the following three conditions? 

a) There exists a positive number M such that —M < f(x) < M for all 
x. 


13. 


14. 


15. 


16. 


17. 


18. 


3.9 Problems Toe 


b) f(1) =1. 
c) Ifa 0 then 


Find all complex-valued functions f of a complex variable such that 


f(z)+z2f—z)=1+z 


for all z. 


x Suppose that f : R — R has the following properties, 

a) f(a) <1, for all real a; 

b) f(w@+ B) + f(x) =f (w@+%) +f (e+ 3). 

Prove that f is periodic. In other words, prove that there exists a nonzero 
real number t such that for every real number x, we have f(x+t) = f(z). 


x Show that there exists a function f : R — R which is continuous, and 
satisfies the equation 


f(x) + f(2@) + fa) =0 


for all x, but which is not the zero function. 


x Prove that the equation 
f"() =a 


defined on the nonzero real numbers has an infinite number of solutions 
for each positive integer n > 2. 


Show that there is a unique function f : R > R, such that f(z) is strictly 
positive, satisfying for all real x, 

a) f(22) = [f(a)]?; 

b) f(—2) = [f(@))-}, and 

c) f(a) >1l+e2. 

Determine this function. 


* Prove or disprove: there exists a function f(n), defined for all positive 
integers n, taking values in the positive integers, such that 


fFM(n) =n41. 


In this expression, f/((n) denotes that f is applied f(n) times to the 
positive integer n. 
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19. 


20. 


21. 


22. 


23. 


24. 


25. 


3 Functional equations with one variable 
Suppose that f : R — R is continuous and satisfies the equation 
f(x) - flf@)|=1 
for all real x. Assuming that f(1000) = 999, find the value of f(500). 


x Find all continuous functions f : R — R which satisfy the equation 


fet+y) = fla+ fy) 


for all real x and y. 


x For the previous problem, give an example of a function which is not 
continuous, and also satisfies the functional equation 


fle+y) = fla+ fy)]. 


Find all continuous functions f : R — R satisfying 
fle + f(a)] = f(@) 
for all real x. 
x Let u, f, and g be functions, defined for all real numbers x, such that 


= = f(a) 


and 
u(a + 4) + u(x — 4) 


2 
Determine u(x) in terms of f and g. 


= g(x). 


* Find all continuous solutions to the simultaneous equations 


ff(x)] = x 
and f(a) 


where x > 0. 


* Suppose that f: R* > Rt is a strictly decreasing continuous function 
that satisfies the equation 


fle+y) + fF) + fl = Fla + FO) + fly + F@)) 
for all positive x and y. Prove that f[f(x)] = a. 


A 


Miscellaneous methods for functional equations 


4.1 Polynomial equations 


In this chapter, we consider sundry functional equations in which there is 
some specialized character to the problem which requires techniques that are 
not easily exported to other settings. The restrictions that we consider here 
are of various types and are typically much stronger than the assumptions of 
previous chapters. Finally, at the end of the chapter, we consider a functional 
equation whose solution requires some group theory. 

Let us begin by looking at polynomial equations. We assume that the 
functional solution to the equation is known to be a polynomial. Clearly, 
the methods of previous chapters are applicable in this context. However, 
it is worth considering what additional methods are available if we know 
that the solution is a polynomial. In what follows, we usually assume that 
the polynomials have real coefficients, and are functions of real variables. 
However, sometimes we have need to evaluate these polynomials for values 
in the complex plane even when the functional equation is stated for real 
numbers. So we consider polynomials of a real or complex variable with real or 
complex coefficients. First, it may be possible to use the equation to determine 
the degree of the polynomial. Suppose we let deg f denote the degree of a 


polynomial f(x). Let (fg)() = f(x) g(x) and (fo g)(x) = flg(x)]. Then 
deg [f g] = deg f + deg g (4.1) 
deg fog = deg f- deg g. (4.2) 


Using these formulas, any polynomial equation using the operations of multi- 
plication and composition leads to an equation for the degree. 


Example 4.1. Consider an equation such as 


f(e@—1) f@+1) = fF), (4.3) 


where f is a polynomial with real coefficients. Let d = deg f. The left-hand 
side of the equation is of degree 2d, whereas the right-hand side is of degree 
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d?. Thus 2d = d?, implying that d = 0 or d= 2. A polynomial of zero degree 
is constant, and the only constant f(x) = c satisfying this equation is c = 0 
orc= 1. 

The case d = 2 can now be considered. It is left to the reader to verify 
that 


f(a) = (@-1)? 
in this case. 


The next method for solving polynomial equations involves the use of the 
following principle. 


Proposition 4.2. Suppose f(x) is a polynomial which is periodic in the sense 
that there exists some a #0 for which f(x +a) = f(x), for all real x. Then 
f(a) =c for all x. 


The proof is routine. Now let us consider how this can be applied to functional 
equations. Suppose a solution, say fo(x) has been found to a given polynomial 
equation. We must ask whether this is the only solution, or whether there are 
others. We can write a general solution in the form 


f(x) = fo(x) + g(@), 


where g(a) is some polynomial whose form is to be determined. It may be 
possible to show that g(x) satisfies the conditions of Proposition 4.2, and 
thereby to conclude that all solutions of the equation have the form fo() +c. 


Example 4.8. To illustrate this, consider all polynomials satisfying the equa- 
tion 
f(et+1)—f(z-1) =627 +2. 


By inspection, we can observe that. f(a) = x? is a solution. But is it the only 
one? Letting a general solution have the form f(a) = x? + g(x), we plug into 


our equation to obtain 
g(x+1)—-g(x-1) =0 for all x. 


So g(x) is a polynomial satisfying the conditions of Proposition 4.2 with a = 2. 
Therefore f(x) = x3 +c. It is immediately seen that any value of ¢ will work. 


Functional equations can also be used to provide direct information about 
the roots of the polynomial. To illustrate this, here is an example from the 
1995 Romanian Winter Competition. 


Example 4.4. Let us search for all polynomial solutions to the equation 


f(a’) = f(x) f(@—-1) (4.4) 


which are not identically zero. From this equation, we cannot immediately 
infer any restriction on the degree of f(a), as the associated equation for 
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d = deg f is 2d = 2d, which is trivially satisfied. Instead, it is helpful to 
inquire as to the nature of the roots of f in the complex plane C. Suppose 
f(z) = 0. Then it follows from the equation that f(z?) = f(z) f(z -1) =0. 
So z? is also a root of f. So if Z is the set of all roots of f in the complex 
plane, then Z must be closed under the mapping z +> z?. However, if |z| > 1 
or if 0 < |z| < 1, then the sequence 


ae Aa a ae 


will produce distinct points in the plane. Because Z is finite, this is impossible. 
Thus the roots of f can only occur when z = 0 or |z| = 1. Similarly, if z— 1 is 
a zero, then so is 27. Therefore, Z is closed under the mapping z +> (z + 1)?. 
The rest of the problem is left to the reader to solve. See Problem 10 at the 
end of Chapter 1, and the hints to that problem at the end of the book. 


4.2 Power series methods 
Less restrictive than assuming that an unknown function is a polyonomial is 


the assumption that a function has a power series representation. 
Consider the equation 


(A=) =Fr0., (45) 


The reader will recall that this is an example of Schréder’s equation, as dis- 
cussed in Chapter 3, with a(x) = (x + 2?)/2. Let a(x) = (a + x?)/2. If 
0<a <1, then a(x) goes to zero at a rate that is approximately geometric 
with rate 1/2.' Applying the Koenigs algorithm, we obtain 

f(x) = lim 2” a(x) (4.6) 

noo 

as a solution to the equation. This limit is hard to express using well-known 
functions. However, a”(a) is a polynomial of degree 2n. This suggests that we 
should look for a solution to f(x) as a power series 


f(z) =ap9 +a, 2+ag2*+agz°+-:- (4.7) 


for 0 < x < 1, provided this infinite series converges. Plugging in x = 0, in 
(4.5), we see that a9 = 0. As any constant multiple of a solution to (4.5) is 
also a solution to (4.5), we can regard a; as arbitrary and set it equal to one. 
Let us insert our power series back into (4.5). This gives us 


e+ x? a x+a? : x+ x? : 1 a2 9,43 3 
+a free = ee 
D BN eg a ae Q°° ye ee 


See Section 3.5.1 of the previous chapter for the definition of this. 
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Now we expand out the left-hand side, and group terms with corresponding 
powers of x together: 


1 1 a2 2 (2 =) 3 1 a2 9,43 3 
= = as a ae wee SS a aS (4.9 
sat(5 +2) e4(2+5 x” + ptt ety e+ (4.9) 


Next, we need to invoke a very useful property of power series: two power 
series in x are equal for all x if and only if their corresponding coefficients are 
equal. Set b,, = 27~™ am. We obtain 


by = 0 (4.10) 
by =1/2 (4.11) 


together with 


(27-1 —1) bam = Ge 7 ‘) b2m—1+ Ce - i) bom—2++ +++ (") bm (4.12) 
1 2 m 


and 


ae 2m 2m—1 m+1 
(2?"—1) bom41 = ( 1 ) bam ( 9 ) bamaat--+( ) Pmt (4.13) 


for m > 1. From these recursive formulas, the power series for f(x) is deter- 


mined: : ‘ 2 
823 Met 416 
f(t) = e420? +—— + +t, (4.14) 


A plot of this function is shown in Figure 4.1. 


4.3 Equations involving arithmetic functions 


In the previous two sections we have considered functional equations with 
a restriction on the class of functions. In this section, we shall consider a 
completely different kind of restriction which radically changes the nature of 
the solution. 

Consider the following from the Cauchy family of equations, 


f(xy) = f(x) fy), 


where x and y are strictly positive. In Chapter 2, we saw that the positive 
continuous solutions to this equation (if f does not vanish everywhere) are 
power functions of the form f(x) = x* for some real number k. Now suppose 
we restrict the domain of the function f to the positive integers. It is natural 
to make a shift in notation so that 


f(nm) = f(n) f(m), n,m € {1, 2,3,...}. (4.15) 
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0 0.1 0.2 0.3 0.4 0.5 
Xx 


Fig. 4.1. A power series solution to a difficult Schréder equation 


Such a function is said to be completely multiplicative if this equation is 
true for all n and m. Using induction, it is easy to see that any completely 
multiplicative function also satisfies 


f (nang +++ ni) = f(r) f(na) --- f(na) 


for all positive integers 7. 

What are the solutions to this equation when the domain of the function 
f is restricted in this way to the positive integers? It is clear that power 
functions f(n) = n* still satisfy equation (4.15). Are there other functions? 
We can no longer appeal to the same argument which we used in Chapter 
2. Letting n = m = 1, we see that f(1) = f(1)? as before. So f(1) = 0 or 
f(1) =1. As before, if f(1) = 0 then 


f(r) = fr) 
= f(n) f) 
= 0 


for all n. Thus f vanishes everywhere. 
Let us restrict our attention to the case where f(1) = 1. Suppose the 
integer n > 1 has prime factorisation 


— hi pd d; 
N= Py P2 ce ae 
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where p1,...,p; are primes and d;,...d; are positive integers. Applying (4.15) 
and induction, we see that 


f(r) = f(p1)® f(p2)® ++ F(p;)% - (4.16) 


Unlike Cauchy’s multiplicative functional equation, it turns out that this is all 
we can prove, because any function that satisfies this property is completely 
multiplicative. Among all such functions the power functions are simply those 
for which f(p) = p* for every prime p. 

The study of completely multiplicative functions is usually considered to 
be a branch of number theory rather than of functional equations. However, 
this is largely a matter of definition. Several common functions that are used 
in number theory are completely multiplicative. These include the following. 


1. For any prime number p, the Legendre symbol is defined as 


re 1 if n is a square mod p 
(=) a 0 if p divides n (4.17) 
P -1 if n is not a square mod p. 


We can also write 


(=) = ne-V)/2 (mod p). 


Pp 


Weg ol: 


2. The Liouville function is defined to be 


Then 


An) = (-1)2? (4.18) 


where 2(n), known as Big Omega, is the number of prime factors of n, 
with multiplicity counted. That is, 


r (o? py Soecd v'’) — (—1) arta rey . 


Now let us turn to the role of multiplicative functions in mathematics com- 
petitions. Problem 6 on the 1998 International Mathematical Olympiad? held 
in Taiwan was 


Example 4.5. Consider all functions f from the set N of all positive integers 
into itself satisfying 


f (n? f(m)) =m (F()’ , (4.19) 
for all m and n in N. Determine the least possible value of f (1998). 


? On each IMO, problem 6 is traditionally thought of as the “gold medal” problem 
that is designed to separate gold medal winners from the bronze and silver medals. 
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Solution. At first sight this is a functional equation that does not seem to have 
much to do with completely multiplicative functions. Our first task will be to 
elicit the multiplicative property. Let f(1) = a. Setting m = 1, we obtain 


f (an?) = [f(n)]?. (4.20) 


Then setting n = 1, we get 


f(f(m)) = a2 m. (4.21) 
So 
[f(m) f(n)|° = [f(m)? flan?) by (4.20) 


f (m? f (f(an?))) by (4.19) 
= f(m? a? an’) by (4.21) 
f 


=(f(amn))” by (4.20) again. 
Taking the square root of both sides, we conclude that 


flamn) = f(m) f(r). (4.22) 


Setting n = 1 in (4.22), we see that 


f(am) = f(m) FQ) 


Putting (4.23) and (4.22) together we have 


a f(mn) = f(m) f(n) (4.24) 


for all positive integers m, n. 

The next step in the proof is to show that a divides f(n) for every positive 
integer n. Suppose p is any prime number, and suppose that p” and p®* are 
the highest powers of p which divide the numbers a and f(n), respectively. 
Setting m = n in (4.24) we get a f(n?) = [f(n)]?. Continuing in this way by 
induction, we have 


a ey SPO - SRSA Den one (4.25) 


The highest power of p that divides [f(n)|* is p**. Similarly, the highest power 
of p dividing a! is p*—)", So equation (4.25) implies that (kK—1) r < ks for 
every positive integer k. This can only be true if r < s. Now p is an arbitrary 
prime. So a must divide f(n). 

Set g(r) = f(n)/a. Because a divides f(n), the function g takes values 
among the positive integers. In terms of g, equation (4.24) becomes 
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g(mn) = g(m) g(n) (4.26) 


which says that g is a completely multiplicative function. Note also that from 
(4.20) with n = 1, we get f(a) = a’. In terms of g, this becomes 


g(a) =a. (4.27) 


Therefore, 


=n. (4.28) 


Thus g is a completely multiplicative involution on the positive integers. 

Now let us return to our original functional equation given in the statement 
of the problem. In fact, the new function g also must satisfy this equation. To 
see this, we observe that 


We need to prove one final property of g. Suppose p is any prime. Let 
g(p) =tu, where t and wu are positive integers. Then 


g(9(p)) = g(tu) = g(t) g(u). 


Because g(g(p)) = p, it follows that at least one of g(t) and g(u) must be 
equal to one. Suppose, without loss of generality, that g(t) = 1. Then 


t = g(o(t)) = gQ) = 1. 


It follows from this argument that g(p) is a prime for every prime p. 

Recall that we are searching for the smallest possible value of f(1998) 
where f is chosen from the class of functions satisfying this functional equa- 
tion. However, for any such f, we have g(1998) < (1998), because a is a 
positive integer. Therefore it suffices to restrict attention to functions g which 
are completely multiplicative involutions as shown. Because g is completely 
multiplicative, 


g(1998) = g(2- 3° - 37) = g(2) - [9(3)]° - 9(37) 
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where g(2), g(3), and g(37) are prime. If we wish to minimise this, we would 
naturally assign the smallest primes to these values, with the smallest prime, 
namely 2 assigned to g(3). For example, we can set g(2) = 3, g(3) = 2, and 
g(37) = 5. Therefore, the smallest possible value of g(1998) is 2?- 3-5 = 120. 


This beautiful question was a worthy gold medal problem. The solution 
brings together the ideas of number theory, multiplicative functions, involu- 
tions, and algebra. There are many nontrivial steps. However, none of these 
steps requires the use of heavy mathematical tools. 


4.4 An equation using special groups 


We close this survey of functional equations by considering a problem where 
the solution needs some group theory. Consider the following problem, which 
was shortlisted for consideration at the International Mathematical Olympiad 
in 2001 in Washington, DC. 


Example 4.6. Find all functions f : R — R, satisfying 
f(xy) (F(@) — f(y) = (@ — y) Fl) Fy) (4.29) 


for all x and y. 


Solution. Set y = 1 in equation (4.29) to obtain 


[f (2)? = « f(x) FQ). (4.30) 


If f(1) = 0, then equation (4.30) implies that f(a) = 0 for all real x. So we 
restrict attention to the case where f(1) =a #0. 

Now, we could be forgiven for concluding that from here on, the solution 
is obvious. Dividing by f(a) on both sides of equation (4.30), we would seem 
to get f(x) = ax. Because this solution works in the original equation (4.29), 
we might think we are finished. 

However, the situation is not this simple, because we have assumed that 
f(x) £0 when dividing both sides by f(a). So we need to examine this more 
closely. Note first that setting « = 0 in (4.30) gives f(0) = 0. What can we 
say about the set of values x where f(x) 4 0? Let 


G={xreER: f(x) #0}. 
We have already got 
e 0¢G,and1€G. 
Equation (4.30) implies that 
e 2? €GforallzeG. 
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Furthermore equation (4.29) implies that 
e if x and y are elements of G, and «4 y, then ry € G. 


Putting these last two statements together, we conclude that G is closed under 
multiplication. 

Next, suppose f(a) # 0 and f(y) = 0. Once again appealing to equation 
(4.29), we find that f(xy) = 0. So we also conclude that 


e ifee€ Gand y¢G, then ry ¢G. 
A consequence of this is that 


e iff€G,thena2 !eG. 


For if z~! were not in G, then 1 = 2x7! 


contradiction. 
Let us summarise what we have discovered about G. 


would also not be in G, which is a 


1. The set G is a collection of nonzero real numbers that is closed under 
multiplication «y and reciprocation «++ x~! and that contains the mul- 
tiplicative identity 1. 


Now we are in a position to characterize all solutions to equation (4.29). 
Suppose f(x) is any such solution which does not vanish everywhere. Then 
there exists a set G' satisfying the properties listed above and a real number 
a # 0 such that 


Hei" ee (43 


To see that this is true, note first that any such function satisfies the equation. 
In addition the set of values where f does not vanish has the properties given 
above. On this set G we may divide both sides of equation (4.30) by f(z). 
From this we obtain the expression f(a”) = ax, where a = f(1). 


A set G which is closed under multiplication, has the multiplicative identity 1 
and the multiplicative inverse x~! of every x € G is said to be a multiplicative 
group. In this particular case, the group operation is commutative, being the 
usual multiplication that we do on real numbers. Thus G is a commutative 
group or an abelian group. Obviously, the set of nonzero real numbers form such 
a group. However, there are many other choices for G including the rational 
numbers Q. The beauty of this problem is that it illustrates the richness of the 
idea of a group. Although the problem itself has no reference to any special 
multiplicative group of real numbers, the idea of a group is fundamental to 
the solution. 


After considerable discussion, the jury of the 2001 IMO in Washington 
decided not to use this lovely problem as one of the six problems for the con- 
test. Some jury members, this author included, were concerned that students 
might solve the problem and fail to recognise that the solution is complete 
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in the form given. The concern was understandable because our solution is 
rather abstract. Perhaps I, and others on the jury, should have had more faith 
in the insights of some very talented students. We shall never know for sure. 


4.5 Problems 


1. Find all polynomials f(x), g(a), and h(x) such that 


—l for « < —1, 
FORO. Se4o- Seater, 
—2x4+2 forz>0. 


2. In each of the following cases, find all polynomials with real coefficients 
which satisfy the equation. 


a) f(x? +2) = f(x) f(x@+1). 
b) f(g@)) = F(a) g(@). 


3. * Find all polynomials with real coefficients satisfying 


fle—y) + fy—2) + flz—2) =2fletytz) 
for all triples x, y, and z such that ry+yz4+zx4=0. 


4. Let f : NN be a function from the natural numbers to themselves, 
such that f(n +1) > f(n). Suppose that 


f(F(n)) = 3n. 
Evaluate f(2001). 


5. * In each of the following cases, find all functions f : R — R satisfying 
the equation. 
a) f(« f(a) + fy) =F @)" +4, 
b) f (f(x) +9) = f(a? -y) +4 f(a)y. 


6. * Find all functions g : R — R such that there exists a strictly increasing 
or a strictly decreasing function f : R — R such that 


f(at+y) = f(x) g(y) + fly). 


7. x Find all nondecreasing functions f : R — R such that f(0) = 0, f(1) = 
1, and 
f(x) + Fy) = f(a) fly) + fl@t+y— xy) 


for all x and y such that x <1 and y > 1. 


8. x Does there exist a function s : Q > {-1, 1} such that, if x and y 
are distinct rational numbers satisfying «y = 1 or «+ y € {0, 1}, then 


s(x) s(y) = —1? 
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Some closing heuristics 


We have discussed many functional equations in the previous chapters. How- 
ever, the next functional equation you encounter may be completely different 
from any of these and may require radically different methods to solve. In 
that case, some real ingenuity may be necessary. 


Nevertheless, there are a number of rules of thumb that can help you get 


started on a new functional equation. I offer the following suggestions in the 
hope that they will be of use to you. 


Good luck! 


. Get to know the standard functional relationships satisfied by the usual 


suspects: linear functions, trigonometric functions, logarithms, powers, 
and the like. Don’t forget reciprocals and functions of the form 1/(x+ a), 
and so on. 


. Try plugging in a few simple functions such as linear functions or polyno- 


mials and solving for the coefficients. 


. Try solving for a few values such as f(0), f(1), f(—1), and so on. If you 


can find enough of these you may be able to recognize the form of the 
function. 


. If the equation involves two variables, say x and y, try substitutions for y 


in terms or x (or vice versa) which 

e reduce the complexity of the expression, 

e make a term on the left-hand side equal to a term on the right-hand 
side so that the terms cancel. 


. Look for functional equations that are symmetric in x and y, say, on one 


side and asymmetric on the other side. You can make a new functional 
equation by switching the roles of x and y. 


. Try to determine from the functional equation whether the function is 


strictly positive, strictly negative, or has roots for particular values of «. 
This may give insight into the function. Check to see what happens to 
f(x) if x is very large or very small. Check to see whether the functional 
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equation tells you if f(a) is a monotone function. Check to see whether 
the function can be proved to be unbounded. 
Watch out for the opportunity to linearize the equation. 


. If the equation involves f[a(x)], investigate the splinter a” (x). If a(x) = 


x for some m, it may be possible to solve for f(a) by applying the equation 
m times. 


. Watch out for periodic functions, that is, those for which f(a+a) = f(z) 


for some positive a. A statement of periodicity is itself a functional equa- 
tion, and can sometimes be derived from the given functional equation. 
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Appendix: Hamel bases 


It would have been appropriate to mention Georg Hamel and his connection 
with the theory of functional equations in our historical survey of Chapter 1. 
However, the work of Hamel is technical, and best studied after a thorough 
analysis of Cauchy’s contributions such as in Chapter 2. I have seen fit to 
place Hamel’s work in this appendix because his contribution to the subject 
depends upon a highly speculative axiom in mathematics, namely the Axiom 
of Choice, which is not accepted by all mathematicians. 

Georg Hamel was born in 1877 in Dtren, Germany, and died in 1954 in 
Landshut, Germany. In 1897, Hamel went to the University of Berlin, where he 
was taught by Hermann Schwarz and Max Planck, to name two. Subsequently, 
he went to Gottingen University, where he studied with Felix Klein and David 
Hilbert. He was awarded a doctorate under the supervision of Hilbert in 1901. 
The subject of his dissertation was Hilbert’s fourth problem. In 1905, he went 
to Brno. It was during the period of his work in Brno that his 1905 paper on 
Hamel bases was written. In this appendix, we briefly consider the concept 
of the Hamel basis for the real numbers and its connection to the theory of 
functional equations. 

In Chapter 2, we saw that the only continuous solutions to the equation 


f(at+y) = f(x) + fly) 


are functions of the form f(z) = ax for any a € R. It was Georg Hamel 
[1905] who managed to show that there exist many solutions to this functional 
equation which are not continuous, provided that one accepts the Axiom of 
Choice. Hamel’s result may be stated as follows. 


Proposition 6.1. There exists an uncountable set U of real numbers such 
that every real number x has a unige representation 


P= ah, (6.1) 
i=1 
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Fig. 6.1. Georg Hamel (1877-1954) 


where n is a positive integer, q1,---;Qn are rational numbers, and u1,...,Un 
are distinct elements of the set U. 


The representation of each real number x in terms of this basis is unique. 
This may be said to be the crucial point of the proposition. More precisely, 
suppose we can write x in two ways, namely 


n n 
c=) Gi Ui , a=) 15 Uj 5 
i=1 j=l 
where q1,---59n;T1,+++;Tm are rational numbers and uy,...,Un,V1,---;Um © 


U. The ordering of terms is arbitrary. So let us assume that 
Uy < U2 S++ << Un, Vy < Vg <+++ << Um. 


Then the uniqueness of the representation in (6.1) implies that n = m, that 
Gq = 7;, and that u; = vj, for alli =1,2,...,n. 
For the purposes of the work which follows, it is easiest to write the rep- 
resentation of each x as 
rs Ya, 


ucu 


where the sum is over all elements of U, but only finitely many coefficients qy 
are nonzero. 
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The set U is called a Hamel basis for R. Hamel’s proposition asserts that 
the set of real numbers, regarded as a vector space with the scalar field of 
rational numbers, has an uncountable basis. To see how this result can be 
useful in determining additional solutions to Cauchy’s equation, let us consider 
some arbitrary function 

a:uoR. 


We define a function f : R — R as follows. For each « € R we can uniquely 


write 
t= 5 du U, 
ucu 


by Hamel’s representation. Define 


f(z) = So qu ou). 


ucu 


Then the following holds. 


Proposition. The functional f satisfies 


f(at+y) = f(x) + fly) 


for all real x and y. 


Proof. Let 
= Dawe 
ucu 
and 
y= Sore 
ucu 


be the two unique representations of x and y as a linear combination of ele- 
ments from the Hamel basis. Then 


et+y= > (qutru)u 
ucu 


is the unique representation of x + y. Thus 


f(@t+ y) = SG + a) a(u) 


ucu 

= > du a(u) + DS Ty a(u) 
ucu ucu 

= f(z) + fly). 


Note that the second equation holds because all sums are finite, and thus we 
may rearrange terms at will. 
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Because all functions f(«) of this form will satisfy Cauchy’s equation, it is 
natural to check whether our “tame” solutions (e.g., continuous, monotone or 
locally bounded solutions) are of this form. They are, because for any a € R, 
we can choose the function a(u) = au. Then 


F(x) = So qu au) 
ucu 


= So du (au) 


ucu 


“(Ss 
ucu 
= f(a). 


Do we have other solutions? It is not hard to see that we must have other 
solutions. There are clearly other ways to define the function a(u). So we need 
only to show that each distinct definition of a(u) leads to a distinct choice of 
f(a). Suppose we have two functions a;(u) and a@2(u) which both determine 
the same function f(x). Then for any x = >> q, u, we must have 


S- Quo (u) = du Q2(u) 2 


ucu 


In particular, we set « = uo, say for any up € U. Then from the uniqueness 
of the Hamel basis representation, we see that q, = 1 and q, = 0 for u = ug 
or u # Uo, respectively. So 


ai(uo) = >> quaa(u) 


ucu 


= f (uo) 


= So qua2(u) 


ucu 
= a2 (uo) é 


So just as there are many other functions a(u), there are many other solutions 
to Cauchy’s functional equations other than the tame ones f(x) = ax. 

What do these solutions look like? As we have seen from Chapter 2, any 
solution to Cauchy’s equation which is bounded below (or bounded above) 
on any interval of positive length must be tame. So we must conclude that 
any “wild” solution to the equation is unbounded on every such interval. 
Therefore, it cannot be continuous at any point. 

In attempting to visualise such a function, we find that the situation is 
much worse even than this. It requires the Axiom of Choice, a notoriously 
nonconstructive existence axiom, to demonstrate that such functions exist. 
This means that we have little hope of being able to construct an algorithm 
that would let a computer calculate such a function. 
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Hints and partial solutions to problems 


7.1 A warning to the reader 


It is always tempting after working on a problem for a while to turn to the 
answers at the back. However, the reader is warned not to turn to quickly to 
this section if a problem is a hard nut to crack. Often, a problem which seems 
to be almost impossible on a first run through will yield its secrets after you 
put it aside for a little while. 

Although this chapter is said to contain hints and partial solutions, the 
reader is also warned that some of the partial solutions are just short of being 
complete proofs. So if you only want a hint, do not read too much of what 
follows. 


7.2 Hints for Chapter 1 


1. Note that there are multiple solutions: if f(a) and g(y) are solutions, then 
so are f(z) +a and g(y) — a. Write 


log(l+a+ay+y) = log(1+2) + log(1+y). 
Then set z = 0 to determine g(y) and y = 0 to determine f(z). 


2. Suppose that z and w are any two positive numbers with z? > 4w. Then 
the simultaneous equations 


c+y=z 
Ly =w 
have solutions in positive x and y. Thus f(z) = f(w). To remove the 


restriction that z? > 4w, prove by induction that f(a +n) = f(z) for all 
positive x and for all positive integers n. 
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. First note that f must be a one-to-one function. For if f(n) 
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. Setting x and y to zero, we observe that f(0)? = f(0)4. So f(0) = 0 or 


f(0) = 1. Show that if the former, then f(a) = 0 for all x. The conclusion 
will then follow. Show that if the latter case is true, then f(x) = f(—<). 


. Set a = x and show that f(x) = 6+ 2x”, where 6 is arbitrary. 


a) Try f(a) = 0b”. 

b) Compare this equation with d’Alembert’s equation. 

c) The function f(a) = 0 works. So does f(a) = 2~+. Can you extend 
this solution to a more general family? 

d) Once again, f(x) = 0 and works. Another solution is f(a) = 1. For less 
trivial examples, try a polynomial. What happens when f(x) = x?? 
Note that if f(x) is a polynomial then you can figure out its degree 
quickly by equating the degrees of each side of the equation. 


. Setting x = y, we see that 


g(22) + g(0) = 2[9(x)]’. 


Now apply limits to each side to obtain g(0) = 0. Plug in the values y = 0 
in the original equation and use the fact that g(0) = 0. 


a) Plug in « = 0 and y = 0 and solve. 

b) Square both sides of equations (1.45) and (1.46), and then add. 

c) It is necessary to show that the only bounded solution to Cauchy’s 
exponential equation such that h(0) > 0 is the function h(x) = 1 for 
all x. Use h(0) = h(x) h(—x) to show that h(a) > 0 for all a. 

Now suppose that there is some 29 such that h(aq) 4 1. Use this fact 
to prove a contradiction by showing that h(a) is not bounded. 

d) Differentiate Cauchy’s exponential equation to y, and set y = 0 to 
derive the equation h’(x) = h(a) h’(0). In addition, prove that h’(0) = 
0. The result will follow. 


= f(m) it will 
follow by (a) that n = m. So we can write (a) as f(n) = f~'(n). From 
this, we get in (b) 
f(n+2)+2= f(n) 
= f(n). 


Equivalently, we have f(n+ 2) = f(n) — 2. The result now follows easily 
from an argument by induction. 
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9. Suppose we can show that f is a strictly increasing function. That is, 
suppose that f(n) < f(m) if and only if n < m. Then in particular, we 
would have n+1 > f(n) for all n using the property of the function given 
in the question. However, a strictly increasing function from the positive 
integers into the positive integers has the property that f(n) > n. It would 
follow that n+1> f(n) > n, and therefore that f(n) =n, for all n. 

To demonstrate that f is strictly increasing, first note that f(1) < f(n) 
for all n > 1. To prove this, we need to recognize that f(1) is the only 
value in the range of f for which we cannot construct a smaller value: 
f(n) = f[{(m-1) +1] > fl f(n— 1] for alln > 1. 

Next, define the function fi(n) = f(n+1)-—1 for n = 1, 2,.... Once 
again, prove that fi(1) < fi(n) for all n > 1. Continue this argument. 


10. a) The only constant solutions are g(7) = 0 and q(x) = 1 for all x. So 
we restrict consideration to the case where q(x) is nonconstant. Note 
that for « = 0 and x = —1 the equation becomes 


q(0) = 4(0) (1), q(1) = q(-1) q(0). 


Thus 0 is a root if and only if 1 is a root. Investigating functions of 
the form q(x) = x* (x — 1)* we observe that these are solutions. Are 
there any more? 
Let us write 

g(x) = aa! (x —1)* r(z) 
where neither 0 nor 1 is a root of r(x), and the leading coefficient of 
r(a) is 1. Plugging into the equation, we see that 


ax (x —1)* (a +1)¥ r(x?) = a? ao) * (29 +.:1)9 (w@ —1)¥ r(x) r(x +1). 


So a = 1. Because x cannot divide any of r(x”), r(x), and r(x+1), we 
conclude that 7 = k. Cancelling the appropriate factors on each side 
we see that r(x?) = r(x) r(a +1) where 0 and 1 are not roots of r(z). 
We show that r(a) = 1. 
Suppose not. So r(x) has a complex root, say a 4 0. Because r(a) = 0, 
we have 

r(a”) = r(a)r(a+1) =0. 


So a? is also a complex root of r(x). Continuing in this way, we con- 
clude that a, a?, a*, and so on are all roots of r(x). But a polynomial 
can only have finitely many roots. If |a| 4 1 then this sequence of 
roots would be distinct, which is impossible. So |a| = 1. 

Similarly, if a is any such root, then 


r((a- iF) =r(a—1)r(a) =0. 


So (a — 1)? is also a root. It follows that |a — 1] = 1. There are two 
points in the complex plane satisfying |a| = 1 and |a — 1| = 1. These 
are 
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a=5+> 7. 
However, we observed earlier that if a is a root then so is a?. But this 
property is not satisfied by the only two possibilities for roots listed 
above. 
In this case, the situation is very similar to the previous case up to 
a point. Note first that p(1?) = p(1) p(0). So p(1) = 0 or p(0) = 1. 
Suppose that p(1) = 0. Then p(2?) = p(2) p(1) = 0. Continuing in this 
way, we deduce that p(n) = 0 for infinitely many positive integers n. 
As a nonconstant polynomial, p(a) can have only finitely many roots. 
Thus we have a contradiction, and can conclude that p(0) = 1. 
Suppose p(a) = 0 for some complex number a. Then p(a?) = p(a*) = 
--+ = 0. However, a polynomial can only have a finite number of roots. 
So a must be a root of unity. Therefore, |a| = 1. Similarly, the reader 
can prove that |a + 1] = 1. Use this to show that a? +a+1=0, and 
therefore that 

Aas: 


oy 
we 


2 2 


Now there is an important difference between this solution and that 
we found above. In this case, we have two cube roots of unity, whereas 
the two values found above were not. 

Now the polynomial p(x) has real coefficients. Therefore, if 7 —aisa 
factor of p(a) then so is x—@. So we can conclude that («—a) (a—@) = 
x*+a+1 is a factor. Indeed, any nonconstant factor of p(a) with real 
coefficients must be of the form (a? + # +1)”. 

It follows that p(x) can be written as p(x) = c(x? + 2+ 1)", where 
c is some real number. However p(0) = 1. Therefore c = 1. It can be 
checked that n can be any positive integer. 


Make the substitution g(x) = f(«) — 4, and write the functional equation 

in terms of g(x). 

a) Show that g(x + 2a) = g(z). 

b) Make the substitution h(x) = 4 [g(x)]? — 5, and show that h(x +a) = 
—h(a). Show that h(a) can be defined arbitrarily on the interval [0, a) 
subject to the restriction that |h(x)| < }, and then can be extended 
to all real x. 


Define 

g(x) = f(a) — « fC) + « f(0) — FO) 
and show that g(x) has the same property as f(a). It is sufficient to show 
that g(x) = 0 for all x € {0, 1]. 
Now g(0) = g(1) =0. Let M and m be the maximum and minimum values 
of g on the interval (0, 1]. By the extreme value theorem, the sets 


13. 


14. 
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{x € [0,1] : g(x) = M} 
and 

{x € [0,1]: g(x) =m} 
are both nonempty. As f is a continuous function, both of these sets must 
be closed, and therefore have maximum and minimum values. Let 

y = max{z € [0,1]: g(x) = MY}. 

If 0 < y <1, there would exist an h > 0 for which 


uw - 99D tay +h) | 
2 

As M is the maximum of g, this would imply that g(y+h) = g(y—h) = M, 

contradicting the fact that y is the maximum value at which g attains M. 

So y must equal 0 or 1. From this, it follows that g(a) < OforallO<a#<1. 

Similarly, it can be shown that g(x) > 0. 


Let c = g(0). Then f(a + c) = 2% + 5. So f(z) = 2(z — c) +5. Solving for 
g(y) in the original equation, we get g(y) = y/2+c. Therefore, g[z+ f(y)] = 
(a + 2y + 5)/2. 


Define 

S={ax: f(r) =a}. 
Use the continuity of f to show that S is a closed set. From condition (b), 
we know that S is not empty. 
Let us suppose that S #4 R. We prove a contradiction. As S' is closed, 
we can let zo be a boundary point of S. Suppose that y ¢ S. Then there 
exists some real number r 4 0 such that f(y) = y +r. Use property (a) 
and the fact that f is one-to-one to prove by induction that 


fytnr)=(ytnr) +r. 


Now suppose x ¢ S. Let « = |f(x) — a|. As f is continuous, there exists 
a0 <6 < e€/4 such that |z — z| < 6 implies that |f(z) — f(x)| < €/4. 
Additionally, the continuity of f implies that there exists an 7 > 0 such 
that 7 < 6, for which |w — x| < 7 implies that |f(w) — f(ao)| < 6. 

In particular, let us choose the real number y such that |y — 2o| < 1, 
and such that f(y) 4 y. (Don’t forget that xo is a boundary point of S. 
Therefore, we can get arbitrarily close to x9 with points that are not in 
S.) Then 


0<|f(y) — yl SIFY) — Flxo)| + |f(20) — yl 
= |f(y) — f(xo)| + |xo — y| 
<d+7 


< 206. 
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Now let r = f(y) — y. Because 0 < |r| < 26, there exists an integer n such 
that |(y+nr)—2| <6. But f(yt+nr) = (y+nr)+r. Therefore 


e=[f(2) - al 
< |f(2) — Fy + nr) + [FG + nr) = al 
<i l(ytnr)+r—2| 
< Ft l(y+nr)— ol + Ir 
€ 
<7 t6+26 
eee 
4 4 2 
=e. 


This is a contradiction, which we obtained by assuming that S # R. 


7.3 Hints for Chapter 2 


1. Just follow the proof of Theorem 2.3. You do not need to worry about 
negative values this time. 


2. The steps given in the problem are big hints in themselves. 


3. First, suppose that g(0) = 0. Using the equation f(y) = g(0) h(y) it will 
follow that f(y) vanishes for all y. However, the equation g(x) h(y) = 0 
for all x and y implies that either g vanishes for all real numbers or h 
does. Similarly, if h(0) = 0, the same conclusion follows. 

Suppose then, that g(0) and h(0) are both nonzero. Set g(0) = a and 
h(0) = b. Prove that g(x) = b-1 f(x) and h(x) = a! f(x). Then prove 


that 
f(x) 


fo(x) = eas 


is continuous, and satisfies Cauchy’s exponential equation. 


4. Let f(0) = b, and define fo(x) = f(x) — b. We need to show that fo 
satisfies Cauchy’s equation. First of all, set y; = 0 for all i, and set x, = 
0, 2 = x, and x3 = —2, to show that fo(—x) = —fo(x). Secondly, 
set y; = 0 for all ¢ and write 73 = —(a1 + x2) to obtain the equation 
fo(a1) + fo(v2) + fo(—a1 — x2) = 0. Then apply the first result. 


5. The proof of this result carries through in much the same way as the 
original proof of Theorem 2.3, with the usual addition replaced by addition 
modulo one. Because f(0 + 0) = 2 f(0) (mod 1) it follows that f(0) = 0 


7.3 Hints for Chapter 2 103 


(mod 1). From this we also get f(1) = 0 (mod 1). From this fact, we can 


show that 
1 
k 
a1 (2) =0 


mod 1, which is equivalent to 


1 
(se) = 5 


mod 1, again. It must be established that the value of n is independent 
of the choice of & in this formula. From that point on the proof is similar. 
We extend this to show that 


i(B)=<3E we 


and use the fact that the dyadic rationals are dense. 

6. Replace addition by addition modulo one in both the domain and the 
range of f. By combining the methods of Problems 4 and 5, it can be 
shown that f(z) =na+b (mod 1). 

7. Consider the equation 

f{(tu) x, (tu) y] = ft (ua), t (uy). 


Apply the modified Euler equation (2.66) once to the left-hand side and 
twice to the right-hand side to show that the function h satisfies the 
equation h(tu) = h(t) h(u). 


8. Suppose b > a. Define « = (b— a)/n, where n is a positive integer. Then 
define x9 = a, and 7; =a+7e, fori=1,...,n. Then 


If(a) — FIs y |f(ws) — f(@i-1)| 


< Son? K (b-a)? 
i=l 
=n 'K(b-a)’. 
Letting n — 00, we have n7! K (b— a)? > 0. 


9. No. The function f(x) = |z| is subadditive. (This is the triangle inequal- 
ity.) In addition |x| > x. However, |x| 4 x when «x is negative. 
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10. Note first that if f(#) = 0 for any rational 2, then f must vanish on 
all rational values. Henceforth we assume that f(#) # 0 for all x. Let 
f(0) =a. Prove by induction that 


Therefore we have 
F(@+1) = flz+ f(-a)] = f(x) f(-a@) = f(a). 


It follows that f(m) = a for all integers m. Now suppose that m/n is in 
the range of f for some integers m and n. So there exists an x such that 
f(x) = m/n. We can write 


a= f(m) 
ree | 
= f[n f(x) 
m n 
=0(2)" 
As a £0 we can cancel to obtain m/n = +1. 


Can m/n = —1? Suppose that —1 is in the range of f. Then f(a — 1) = 


f(x)(— 1). But this contradicts the statement that f(a +1) = f(a) for 
all x. 


11. The steps given in the question are hints enough. 


12. The zero function is one solution to this equation. Suppose that there 
exists a real number yo such that f(yo) 4 0. Then 


HTGA) =F (= - i) =F Hey 


This implies that f(x) = f(|2]) for all real x. If we define g(x) = f(./2) for 
x > 0, then g satisfies Cauchy’s exponential equation g(a + y) = g(x) g(y) 
for x, y > 0. 

Now if g(0) = 0 it can be seen that g(a) = 0 for all « > 0. Suppose 
that g(0) # 0. Then the proof that g(x) > 0 for all x > 0 is similar but 
not identical to the proof for Cauchy’s exponential equation because g is 
restricted in its domain of definition to nonnegative x. Note that if g(x) = 
0, then g(a/2) = 0. So if there were an x > 0 for which g(x) = 0, then 
it would be possible to construct a sequence x, «/2, x/4, ... converging 


13. 


14. 
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to zero such that g vanishes on all these values. However, g is continuous. 
So this would imply that g(0) = 0, a possibility that we have already 
eliminated. It follows that g(a) 4 0 for all nonnegative x. Using g(x) = 
[9(x/2)]?, we observe that g(a) > 0. The rest of the argument parallels the 
original argument for Cauchy’s exponential equation. We conclude that 


f(x) = [FDI. 


Note that f(x) = 0 for all x is a solution to the equation. Let y = 0. Then 
[f(x]? = [f(x)]}? [f(0)]?. If f(x) 4 0 for some value of x, then f(0) = +1. 
But f satisfies the equation if and only if —f does. So we can reduce to the 
case where f(0) = 1. Putting « = 0, we can conclude that f(y) = f(—y) 
for all y. Putting x = y, we can conclude that f(2x) = [f(z)]*. 

From this equation, we see that if f(x) = 0 for any x, then f(#/2) = 0 
as well. Using an argument similar to the previous problem, we see that 
f(x) #0 for all x. Because f(x) = [f(a/2)]* it follows that f(x) > 0 for 
all x. 

By induction, it can be established that 


2 


f(na) =[f(a)]” . 
Letting « = 1/n we get f(1) = [f(/n)}”, and thereby 


f/m) = [Fay 
So 


2 


f (=) = fim a/n)] = Faye = rae’. 


Apply continuity to show that f(x) = Lr]. Again, if f(a) is a solution, 
then so is — f(z). 


a) Write u = f(x) and v= f(y). The equation becomes 


(@f-") (S) _ (of )(e) + (9f (0) 


2 2 


which is Jensen’s equation. The result follows by solving Jensen’s equa- 
tion. 
b) Let g(x) = f(a +t). Then the equation can be rewritten as 
go a eu fo - 5) 


2 2 
Applying the result of part (a), we see that 
f(a +t) = a(t) f(x) + b(t) 


for all real x. This is an example of Vincze’s equation. So we must 
have 
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f(z@)=cxr+d or f(x) =cs* +d. 


The first of these gives rise to the usual arithmetic mean. The second 
mean is 


- = + “| 
Ss 9 
which is called the exponential mean. 
c) Use logarithms to transform this property into the translative property 
of part (b). The two homogeneous means are the geometric mean and 
the root mean power 


xP + yP ue 
5 


15. Let x = y = 0 in the equation to obtain f(0) = 0. Next, we differentiate 
both sides of the equation, once with respect to x and once with respect 
to y. The resulting equation is 


O= f"(@+y) f@—y)- flet+y) fey). 
Let « = (u+v)/2 and y = (u— v)/2. We obtain 


f'(u) f(r) = Fw) f°). 


The zero function solves this equation. Looking for nontrivial solutions, 
we suppose there exists a value vg such that f(vg) 4 0. Then the equation 
becomes f’(u) = cf(u), where c = f”(vo)/f(vo). This is a standard 
differential equation with the initial condition f(0) = 0. If c > 0, we have 
f(u) = A sinh(/cu). If c = 0, we have f(u) = Au. Finally, if c < 0, we 
have f(u) = A sin(/—cu). 


16. Step 1. Prove that f;(0) =0, using rule (b). 

Step 2. Prove that f)(x) = x, using rule (a) and Step 1. 

Step 3. Prove that fn(c, 0,...,0, —c) = 0. To show this, use the symme- 
try property of f,, to switch c and —c, and use rule (b). 

Step 4. Prove the required result using induction. From Step 2 it is clear 
that the result is true for n = 1. Suppose it is true for n = k. Rule (c) 
says that fp41(@1, ---, +1) depends only on xg41 and 71 +---+a%. Let 
a, +-+++ax¢41 = 0. Then using the observation above, we get 


Fe+i(@i,--+, @k+1) = froi(—ae41,0, -.., 0, ae41) 
=0. 
Finally, let 21,...,@41 be arbitrary with (a1 + ---+a%g41)/(K +1) =u. 
Then 
Fe+i(@1, --+, Capi) = f(@i —U,..-, Choi —U) + 
=O0O+u 


=U. 


7.4 Hints for Chapter 3 107 


7.4 Hints for Chapter 3 


1. For any x, the sequence 27" x, n = 1, 2, ..., converges to zero. Conclude 
from the functional equation that f(x) = f(2~” x) and use the continuity 
of f to show that f(x) = f(0) for all «. Hence f is a constant function. 


2. The function f(x) = «+ sin(a 2x) will work. 


3. First show that a(a#) = cosz is a contractive mapping. Use this to show 
that a”(x) = cos” converges to the unique solution to the equation 
COS Zp = Xo. As Xo is in the closure of the orbit of every x, it will follow 
from the continuity of f that f(x) = f(xo). So f must be a constant 
function. 


4. Prove that '(a+1)=aI (2). 


5. Suppose that f is neither strictly increasing nor strictly decreasing. We 
prove a contradiction. Under this assumption, it must be possible to find 
a set of real numbers x < y < z such that either max[f(z), f(z)] < f(y) 
or else min[ f(x), f(z)] > f(y). It is sufficient to examine the first of these 
cases as the second is similar. We can also assume that f(z) > f(x) as the 
other case, namely f(z) < f(x) follows by reversing the roles of x and z. 
Because f(z) lies in the open interval (f(x), f(y)), it follows by the inter- 
mediate value theorem that there exists a real number w between x and 
y such that f(w) = f(z). However, z is greater than x and y. So w # z. 
This contradicts the fact that f is a one-to-one function. 


6. This is an example of Schréder’s equation with a(#) = sa/(1+ x). The 
Koenigs algorithm gives f(a) = x/(1+a —s) as a nontrivial solution. The 
equation a(x) = x has solutions x = 0 and x = s — 1. We may define f 
for all x such that « # s—1. 

7. The function g(x) = x71 satisfies Abel’s equation gla(x)] = g(x) +1, 
where a(x) = «/(x+1). So, using equation (3.38), we deduce that f(x) = 
x/(ca +1) is a solution to the given commutativity equation for all c. 


8. We start with the functions f and g defined on [1, 00). On the interval 
(0, 1), define f and g to be 1/f(1/x) and 1/g(1/z), respectively. Next, set 
f(0) = 0 and g(0) = 0. To extend f and g to negative numbers, we define 
f(—«) = f(a) and g(—2x) = g(x) on (—ov, 0). Defined on the entire real 
line in this way, the functions f and g can be seen to satisfy the required 
equations. 
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9. 


10. 


11. 


12. 
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The constancy of f can be proved for all a and b such that b/a is irrational 
and positive. Take any real values x and wo, and suppose that f(x) > 
f(vo). Then there exists a 6 > 0 such that f(x) > f(t) for all t such 
that |t — xo| < 6. Next, the reader should prove that there exist positive 
integers n and m such that 


lc -—ay —ma+nb| <6. 


So f(x) > f(a —ma-+nb). However, from our functional inequality, we 
have 
fle —ma+nbd) > fe+nd) > f(a) 


which is a contradiction. Similarly, f(a) < f(a) also leads to a contra- 
diction. We conclude that f(x) = f(xo) for all x. That is, f is a constant 
function. 

The reader should note that we have not used the continuity of f at any 
place other than xp. Thus it is sufficient to assume that there exists a 
point x9 at which f is continuous. 


This problem is a bit of a mathematical joke, and is much easier than it 
looks (in retrospect). Let g(a) = a for all x. Then our equation becomes 
f(a) =a. But this is true for all real numbers a. So f(x) = x for all a. 


Let a(x) = (a — 1)/ax. To get some insight into the nature of this func- 
tional equation, we examine the splinter of a. We get a?(a) = —1/(a—1) 
and a3(x) = x. The fact that the splinter of x is finite suggests that the 
functional equation can be solved by simple substitution methods. Re- 
placing x by a(z) in the functional equation, we get f[la(ax)] + f[a?(x)] = 
2—1/z. Replacing x by a?(x) in the original functional equation gives us 
fla? (x)|+ f(x) = (w—2)/(a—1). Adding this last equation to our original 
functional equation and subtracting the intermediate equation, we get 


a — 27-1 
F(z) = 2Q2a(a—1) 


Such a function does not exist. For suppose f satisfies the conditions of the 
problem. Let c be the smallest integer multiple of 1/4 such that f(x) <c 
for all x. Use (c) to show that f(2) = 2 and conclude that c > 2. Also, by 
definition, there exists some real number x such that f(x) > c— 1/4. So 


cer(era)-torp (fect bOl 


Therefore f(1/x) > —1/2. Additionally, 


c2e(l +s") = (2) +12 f+ (c-I) . 


13. 


14. 


15. 
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This last inequality can be checked to be incompatible with the inequality 
c>2. 


As in problem 11, above, we see that if we let a(z) = 1 — z, then the 
splinter a"(z) is a finite set because a?(z) = z. Replace z by 1 — z in 
the functional equation to get f(1 — z) + (1 — z) f(z) = 2-— z. Eliminate 
f(1— z) from the two functional equations to obtain 


(l=z2+27) f(z) S 1-242’. 


It is tempting to leap to the conclusion that f(z) = 1 for all z. However, 
the exceptional values are those values of z for which 1 — z+ z? = 0 as 
we have not assumed that f is continuous. Thus f(z) = 1 for all z except 
possibly z) = e7*/3 and z, = e~7*/3, We have z + 2, = 1 and zz, = 1. 
Write f(zo) = 1+ wo and f(z1) = 1+ wy. Plug these expressions back 
into the original functional equation to obtain wo + zo wi = 0. Thus wy is 
arbitrary, and w , can be defined in terms of wo. 


Let x be any real number. Define a, = f(a + k/6 + 1/7). Then 


Ak 141 + Ok41,1 = Ok + Ok41,141- 


For any positive integer m, we can sum these equations over values of k, 
where 0< k < m-—1, to obtain 


G0 14+1 + Amt = Got + Omi41 - 


Adding these equations over 0 < 1 < n—1 for each positive integer n 
yields 
don + Amo = 40,0 + Amn - 


In particular, we can set m = 6 and n =7 to obtain 2 f(a +1) = f(a) + 
f(x +2). So the sequence f(x+ n) is an arithmetic progression. However, 
by assumption (a), this sequence is bounded. So f(a+1) = f(a) for all a. 


A power function of the form f(#) = x? almost works, because the func- 
tional equation can be reduced to 1” + 2? + 3? = 0. If a is negative, how- 
ever, the expression x? is ambiguous or multi-valued. This problem can 
be eliminated by considering a function of the form f(#) = |a|?. Unfortu- 
nately, there is no real number p that satisfies the equation 1?+2?+3? = 0. 
But if we could find a complex number p satisfying this equation, then 
we could define f(x) = #(|z|?), where # denotes the function which gives 
the real part of any complex number. To ensure that f is continuous, we 
need to have R(p) > 0. 

Let p =a+b\-—l. Then to satisfy the equation 1” + 2? + 3? = 0 we need 
to have 

2° sin(b log 2) + 3° sin(b log 3) = 0 
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and 
2° cos(b log 2) + 3° cos(b log 3) = —1. 


When a = 0, the first equation is satisfied by b = 1/log(3/2). However, in 
this case, the left-hand side of the second equation is greater than —1. On 
the other hand, as a approaches infinity, then the first equation is solved 
by a value of b which approaches the solution to sin(b log 3) = 0. A value 
of b which solves this is b = 37/ log 3. For this choice of the left-hand side 
of the second equation goes to —oo which is less than —1. To complete 
the proof, the reader will need to argue carefully that somewhere between 
these extremes lies a solution to both equations. 

The reader with a more practical bent may find such an existence argu- 
ment a bit unsatisfying. Those of such a persuasion may be satisfied to 
know that 

p © .9405921833 + 8.230709548 V/—1 


will work. A plot of the function f(x) = R(\z|?) can be found in Figure 
Tal. 
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Fig. 7.1. A function satisfying the required functional equation 


Suppose we could partition the nonzero real numbers into n sets A;, Ag, ... 


such that 
a) there exists a function g; : A; > Aj41 which is one-to-one and onto 
Ajs1 for alli =1,...,n—1; 


» An 


17. 


18. 


19. 


20. 
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b) if x € A; then ete Aj. 

We could then define f(x) = g;(x) for x € A;, and f[gn—19n-2°°- 91(X)] = 
x’ for all x € Ay. As every element y € A, can be represented uniquely 
a Jn—19n—2°°: gi(x) for x € Aj, it follows that f is well defined on the 
nonzero real numbers, and that f"(x) = a7!. 


The function f(x) = e” satisfies these conditions. We need to prove that 
it is the only one. We can prove using induction and condition (a) that 
f(x) = [f(2-"2)]?". So (1+ 2-"x)?”" < f(x) by condition (c). Similarly, 
we have f(—x) > (1 — 27"2)?". Therefore, 


(40s < fey =f <0 eae 1 Sa ee)- 


Taking limits as n — oo gives us e® < f(a) < e*, which gives us the 
required result. 


The function does not exist. Consider a sequence defined by a; = 1, a2 = 
f(1), and an = f(an—-1) for all n > 2. The functional equation can be used 
to prove that the sequence {a,,} contains all positive integers. To see this, 
note first that the number 1 is obviously in the sequence. If a, = x, then 


ant yta) = FAO) (m) = f(a) = 2 +1. 


So by induction the sequence must hit every positive integer. Also all 
members of the sequence are distinct. (If a, = dm where n 4 m then the 
sequence cycles and therefore the range of the sequence is finite. But this 
would contradict the result just proved, namely that the sequence contains 
all positive integers.) Finally, {a,} is an increasing sequence: If x = ax, 
then 2 + 1 = az4sf(z). So «+ 1 appears later in the sequence than x for 
all x. So the sequence must be a, = n for all n. Therefore f(n) =n +1, 
which does not work. 


We have f[f(1000)] = 1/999. So the range of f contains 999 and 1/999. 
By the intermediate value theorem, there exists an a such that f(a) = 500. 


So f(500) = f[f(a)] = 1/500. 


The function f must be constant or be the identity function. Letting x = 0 
we see that f(y) = f[f(y)]. So f(a) = x on the range of f. Let a and b be 
the minimum and maximum values, respectively, of f. (The minimum or 
maximum could be infinite.) As f is continuous, the intermediate value 
theorem tells us that the interval (a,b) lies in the range of f. So f is the 
identity on (a, 6). 

If a = 6 then f is constant. If a = —oo and b = +00, then f(x) = a. 
Suppose that a is finite and b > a. The continuity of f implies that 
f(a) = lim, -4a+ f(x) =a. So in this case f must be the identity on [a, b). 
So f(z) = x on some interval [a,a+ 2d] for d > 0. Let 0 <t < d. Then 
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f(a—t) =a+s for some s > d. (If s < dthena = f(t+a-t) = 
f(t+a+s) =a+s-+t, which is a contradiction.) However, the statements 
f(a—t) > a+d for all ¢ in (0, a) and f(a) = a contradict the intermediate 
value theorem. 

The case where 6 is finite and a < b is similar. 


Try f(x) =a — [a]. 


The function f must be constant. Suppose not. Then there exist real 
numbers 2 and y such that y lies between x and « + f(x) and such that 
f(x) 4 f(y). Then there exists a straight line L defined by the equation 
xz+ny = c, where n is some positive integer, that separates the point 
(y, f(y)) from the points (a, f(a)) and (a + f(x), f(x)) on the graph of 
the function f. By the intermediate value theorem, there exist two distinct 
points A and B with x-coordinates a and b respectively that lie on the 
graph of the function and on the line L. By induction we can establish 
that f(x) = fla+nf(x)]. So f(c) = fla+nf(a)] = f(a). Similarly, 
f(c) = flb+ nf(b)] = f(b). So f(a) = f(b). Thus A and B have the 
same y-coordinate. But this means that a+n f(a) = b+n f(b) expressed 
in terms of L. So a = b which is a contradiction, because A and B are 
distinct. 


In fact, there are infinitely many expressions for u(x) in terms of f and 
g. Here are three: 


u(x) = g(x) — f(x +3) + fla+1)+ f(x—1)— f(x—3) 
—g(x+2)4+ f(x +5) — f(w@+3)+ f(a@+1)4+ f(x—-1) 
= g(a +4) — f(a +7) + f(a +5) — f(a+3)+ f(x 1). 


Can you find a way of generating all such formulas? 


The unique solution is f(x) = 1/z, a result which is known as Dubikajtis’ 
theorem. To prove this, consider a set Q which is defined as the smallest 
set of real numbers such that 

a) LEQ; 

b) ifa€Q then «+1EQ; 

c) iffeQthenz'e€Q. 

One way to define what we mean by smallest here is to define Q to be 
the intersection of all subsets of R which have these three properties. 
The first step in the proof is to show that Q is the set of all positive 
rational numbers. To demonstrate this, we first prove that Q contains all 
positive integers. This follows easily from the fact that Q is closed under 
translations by 1. Because 2 € Q, it follows from the closure of Q under 
reciprocals that 1/2 € Q. Any positive rational of the form (2m + 1)/2 
can be written as m+ 1/2, which must be in Q because of (b) above and 


25. 
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the fact that 1/2 € Q. Having established that any rational of the form 
n/j € Q for all 1 < 7 < k we proceed to show that a rational of the 
form n/(k + 1) € Q by writing n/(k + 1) = m+ [(k +1)/p]7', where p 
is a positive integer less than or equal to k. By induction, we see that Q 
contains all the positive rationals, and therefore must be this set. 

The next step is to show that f(1) = 1. To prove this, we first show 
by induction that f(n +1) = c/(1+ nc), where c is defined to be f(1). 
Therefore f[c/(1 + nc)] = f?(n +1) =n+1. So 


eae bea rca) 


1l+ne 1+nc 


Cc 1 
=1(75.) 1+ f (aS) 
— 


Taking limits on both sides of this equation as n — oo, and using the 
continuity of f gives us f(1) = 1. 

The next stage of the proof is to show that f(x) = 1/ax on the set Q. 
Clearly, f(z) =1/x when x = 1. If f(x) = 1/z, then 


f(x) ot 1 


~ eee lea! ea 


f(a@+ 1) 
and = 
fe ")=fPa@)sr= (a). 
So fe) =a * on Q: 
Using the continuity of f we extend the result from Q to all of (0,00). 


If we let y = a, we get f(2x) + f[2 f(x)| = f(2 fla + f(x)]). Next, let 
us replace x with f(a) in this equation, and subtract the equation above 
from the resulting equation. We get 


f2 FP? (x)] — f22) = f(2 FF(x) + F?(@))) — FA fle + F(x). 
Suppose f?(x) > x. Then the left-hand side is less than 0. So 
f[f(2) + f?(@)] > fle + f(x). 


This implies that f(x) + f?(x) < 2+ f(x), which is a contradiction. The 
case f?(x) < x is disposed of similarly. 


7.5 Hints for Chapter 4 


1. 


Let 
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p(x) = |f(x)| — |g(z)| + A(z). 
The critical values of the function p(a) are at « = —1 and x = 0. Because 
the function is linear between these critical values we can write 


p(x) =ala+1/+b|a|+cat+d. 
Solve for the constants a, b, c, and d. 


2. a) First note that if x is a root of f(a) then so is 7+. Similarly, if x+1 
is a root of f(a) then so is x? + x. The latter statement is equivalent 
to saying that if x is a root of f(a) then 


(29-1)? +(2-1)=2?-2@ 


is also a root. However, if z 4 0 is any root of f(a) in the complex 
plane, then 
max(|x* + 2], |x? — a|) > |z|. 


Thus, if there were a nonzero root, we would be able to generate an 
infinite sequence of distinct roots. This could only happen if f(a) were 
a constant polynomial with f(«) = 0 or f(x) =1. 
The only other possibility is that f(a) is not constant, and 0 is the 
only root. In this case, f(x) = x* for some k. 

b) Examining the degree of each polynomial, we observe that 


deg f - deg g = deg f + deg g. 


This is satisfied in nonnegative integers if both degrees are zero or 
both degrees are two. In the first case, we must have 


f@)=c, g(a) =1, 


where the constant c is arbitrary. 
When the degrees of both polynomials are two, we must have some 
point x in the complex plane such that g(a) = 0. Then 


This implies that f(x) can be written in the form f(x) = cx (a +a) 
for some constant a. It is easily checked that 


f(x) = ¢(x? +az) 


g(z) = 2? +ar-a4 


is the general solution where a and c are arbitrary. 
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3. Set « = 0 and y = 0. This is a case in which ry +yz+22 = 0. So the 
equation reduces to 


FO) + f(—2) + f(z) =2 Ff). 


In this equation, the special case z = 0 yields the formula 3 f(0) = 2 f(0), 
which implies that f(0) = 0. Therefore 


f(z) + f(-2z) = 2 f(z) 


which is equivalent to f(z) = f(—z). This holds for all z. The polynomial 
f is even. Therefore, we can write f(x) = g(x”), for some polyomial g 
with real coefficients. So 


g((# — y)*) + g((y — z)*) + g((z — #)*) = 2g((a@+ y+ z)’) 


for all x, y, and z such that ry+yz+z2=0. 
For any such triple x, y, z, let r=x—y,s=y—zandt=z-—z. Then 


(ct+ytzPoe?+y+2 
(2? + y? +27) — (ay t+yz+22) 


= (+s? +7) 
= [r? +s? + (—r s)”] 


=r?+st49rs. 


So our functional equation in g reduces to 
g(r?) + g(s*) + g((r +. s)?) = 2g(r? +rs4 87). 
The special case in this equation where r = s yields 
2g(r*) + g(4r*) = 29(3r*). 


The next step is to show that this equation must be satisfied for all real 
r. So far we have only verified this equation for values of r such that 
r=x-—y,r=y- 42, and such that ry+yz+za = 0. However, it is 
possible to show that any real number r has such a representation. The 
values 
c=rt+ Bl y= ual Z= Bl. = 

V3" V3" v3 
work. So, letting u = r?, we deduce that 

2g(u) + g(4u) = 29(3u) 


for all u > 0. Let g(u) = a9 +a, ut+---+a,u”. A simple argument based 
on matching the coefficients on each side of the equation now shows that 
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a; = 0 for all 7 > 3. In addition, 9(0) = f(0) = 0. So ap = 0. Therefore, 
we can write 
g(u) = a, utagu’. 


It follows that we can write 


f(r) = g(r?) = ar? +agr*. 
It can be checked that every function of this form satisfies the original 
functional equation. 


4. Consider f(1). We are given that f(f( 
increasing in n. If f(1) = 1, then f(f( 
So f(1) > 2. If f(1) > 2, then f(f(n)) 
because f(f(1)) =3-1=3. So f(1) = 2. 

Next, we see that f(2) = f(f(1)) = 3. Continuing we get 


f(3) = F(F(2)) =3-2=6. 


Using reasoning of this kind, it is left to the reader to verify by induction 
that for any k = 0, 1, 2, ..., we must have 


1)) = 3, and that f(n) is strictly 
1)) = 1, which is a contradiction. 
> 3 which is also a contradiction, 


f (3*) =2-3* and 
Pas) Sarr. 


What about the numbers between these values? Suppose 3° <n < 2-3". 
Because f(n) is strictly increasing, 


Be (Hy ears, 


There are 3° — 1 numbers strictly between 2-3” and 3*+1. These are the 
possible values of f(n). There are also 3° — 1 numbers n strictly between 
3” and 2-3*. Because f(n) is strictly increasing, it follows that 


f(3* +9) = f(3") +5 


for 7 = 0,1,2,...,3*. 
So 
f(2-3" + 5) = F(F(3* +3) 
=3-(3° +3) 
= 3143; 


forall) 0. De 2.8%; 
We leave it to the reader to evaluate f(2001). 


5. 


a) 
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Set c = 0. We get 
f(f(y)) = (Ff)? +9. 


So it is natural to define f(0) = a so that f(f(y)) = a? + y. The 
function h(y) = a? + y on the right-hand side is onto. That is, as y 
ranges over all reals, so does h(y). Therefore f(y) must be onto as 
well. 

Because f is onto, there must exist a number b such that f(b) = 0. 
Setting x = b in the original functional equation yields 


f(f(y)) =y- 


Thus f is an involution. So 


= flf(f(x)) f(z) + f(y)], replacing x with f(f(2)) 
= flf(w) f(F(@)) + f(y], commuting factors 

= (f(f()))? ty 

=a7+ y 


So (f(x))? = x? for all x. 

Looking at the original functional equation, we see that the solution 
f(x) = x works. Let us look for other solutions. If we have some other 
solution, then there must exist some value r # 0 such that f(r) = —r. 
So for any y, 


rary Toy ey 
= f(r f(r) + f(y). 


Let us square this last expression to see what we get. 


[f(r f(r) + Fy)? = Ir f(r) + FW)? 
= [-r? + f(y)]’. 


Putting these two chains of identities together, we have 


[r2+y]° = [-r? + F(y)] 


But this can only hold if f(y) = —y. It can be checked that this is a 
solution to the original functional equation. 

We conclude that the only two solutions are f(x) = x for all x and 
f(a) = —a for all x. 

We begin by looking for simple polynomial solutions to the equation. 
If f(x) is a polynomial, then each side of the equation is a polynomial 
in both x and y. Matching the degree in the variable x in both sides, 
we conclude that [deg f]? = 2 [deg f]. This is solved by deg f = 0 or 
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deg f = 2. Upon further inspection, we observe that f(a) = 0 works 
in the former case, and f(x) = x? works in the latter case. 
Are there any other solutions? Suppose there exists some real number 
c #0 such that f(c) 4 c?. We show that f(x) = 0 for all x. Let us 
make the substitution 

a? — f(z) 


y= 5) 
in our original functional equation. Then 


p(FRLO) | 5 (APO) 2 see? 100). 


Therefore 

f(x) (2* — f(x)) =0. 
So for every x, either f(z) = 0 or f(x) = x?. In particular, f(0) = 0. 
Additionally, f(x) = f(—«) for all real a. 
So if there are any solutions other than the two polynomial solutions, 
they can only be of the form where f(x) = 0 for some values of x 4 0, 
and f(a) = x? for the other remaining nonzero values. Set x = 0 in 
the original functional equation. Then 


f(-y) = fy) 


for all y. Suppose that there is some value c 4 0 such that f(c) = 0. 
Now set x = c in the original functional equation. Then 


fy=f(e-y). 


Putting these last equations together, we have 


fy) =f(-w =f(e +y). 


Because c # 0, we can conclude that f is periodic, with period at most 
2 

C 

Next, we substitute y = c? into the original equation, and use the 


periodicity that we have just found to conclude that 


f(f(@)) = fF@) +e) 

= f(x?-e) +4 f(a) e 

= f(a?) +42 f(z). 
However, if we substitute y = 0 in the original functional equation, 
we get 

f(f(@)) = f(a’). 
If we put these last two equations together, we see that c? f(x) = 0 
for all x. But by assumption c 4 0. So f(a) = 0 for all x. 


We conclude that there are only two solutions: f(a) = 0 for all x, and 
f(x) =<? for all az. 
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6. Consider the case where f(x) = x for all x. This is a strictly monotonic 
function. Then x = x g(y) for all x. It follows that one solution is g(y) = 1 
for all y. 

Are there any other solutions? Suppose f is any strictly monotonic func- 
tion and g a corresponding solution. Set y = 0. Then 


f(x) = F(a) g(0) + F(0) 


for all z. In order for this to hold, we must have g(0) = 1 and f(0) = 0. 
Because f is strictly monotonic, we can conclude that f(a) 4 0 for any 
value of x # 0. 

Note that the left-hand side of the functional equation is symmetric in x 
and y. We can exploit that symmetry to conclude that 


f(x) 9(y) + f(y) = fl@t+y) 
= f(y) g(x) + f(a) 


for all x and y. Equivalently, we may write 
gly)-1 _ g(#)-1 
f(y) f(x) 
for all x 4 0 and all y 4 0. This can only hold if the left-hand side and 


the right-hand side are in fact constants. So there is some constant k such 
that 


gx)-1_, 
f(x) 

for all « # 0. Therefore, any solution for g(a) can be written in terms 
of f(x) as g(x) = k f(x) + 1 for x # 0. Because we already know that 
f(0) = 0 and g(0) = 1, this equation must also work for x = 0. 
Let us consider the possible choices for g(x) that this equation imposes. If 
k =0, then g(x) = 1 for all x. But this is a solution that we have already 
found. 
Suppose k 4 0. Then 


giaty)=kf(rty)+1 

=k f(x)gy)+kfy)+1 
(g(x) — 1) g(y) + g(y) 
g(x) g(y) - 


So g(x) is a strictly monotonic function satisfying Cauchy’s exponential 
equation. It follows that g(a) = a” for some a > 0 and that f(x) = 
k~'(a” —1). This works. Thus we have all the solutions. 
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7. Consider the function g(a) = f(«+1)—1. The function g is nondecreasing 
and satisfies 


g(0)=0, — g(-1) = —1. 


Let r <0 and s > 0. Then we can write r= 2—lands=y-—1forx<1l 
and y > 1. Therefore 


(—(@-1) (y—1)) 
(l1—(@-1)y-)-1 


(x) + f(y) — F(a) Fly) - 1 
= —(f(x) — Df) - 1) 
= —g(r) 9(s). 


This is equivalent to 

g(r 8) = —g(—r) g(s) 
for all r, s > 0. The condition that the nondecreasing function g satisfies 
this multiplicative property with g(0) = 0 and g(—1) = —1 is equivalent 
to the original problem. So it suffices to find all such g. 
We first consider the case where g(1) = 0. Then for any s > 0 we have 


g(s) = g(s1) 


So any nondecreasing function g such that g(—1) = —1 and g(s) = 0 for 
5 > 0 will satisfy the functional equation for g. 
Next, we consider the case where g(1) 4 0. Note that for all x > 0, 


g(x) = g(x 1) 


Moreover, h(0) = 0, and h(1) = 1. Because h is a monotone function 
satisfying these properties, we conclude that there exists some k > 0 
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such that h(x) = x” for all x > 0. Thus g(x) = cx’, where c = g(1). 
Furthermore for x > 0, 


Thus our full family of solutions for f is in the first case, 


1 forx>1, 
f(x) = 0 for «= 0, 
any fora <0. 


(Here f is constrained additionally to be nondecreasing.) The full family 
of functions in the second case is 


e(z—1)* +1 for «> 1, 
f(z)= 1 for xc =1, 
—(1—2)F+1 for x <1. 


Here, c>0 and k>0. 


. Yes, the function exists. In fact the function is unique up to a sign change. 
Let x be a positive rational number. Write « = a/b, where a and b are pos- 
itive integers that are relatively prime. We apply the Euclidean algorithm 
to the pair (a, 6) to construct a sequence of remainders 1g,11,...,1n as 
follows. Set ro = a, r; = b. Then 


TE-1 = YZ TT G41 


for 7 = 1,...,n, where finally r,4; = 0. Because the value of n is a 
function of x, we can write n(a) for this. For each x € Q, define 


(=1)°) for x >0, 
s(x) = 1 for x =0, 
(Set for az <0. 


For example, if « = 5/4, then 
ro = 5, ry = 4, rg = 1, rz; =0 


so that n(5/4) = 2. Thus s(5/4) = (-1)? = 1. 

Then s(x) satisfies the required conditions. For suppose x + y = 0 where 
x and y are distinct rational numbers. Then one is positive and the other 
negative, say x > 0 and y < 0. Then 
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Thus s(x) s(y) = —1l. 

Next, suppose that x + y = 1. Because x and y are distinct, neither is 
equal to 1/2. Moreover, one of them is positive, say x. Write « = a/b as 
above. Then y = (b — a)/b. There are two cases to consider: y < 0 and 
y> 0. 

We consider the case y < 0 first. In this case, 


s(y) = (non 
= (=e8e ot ; 


It is left to the reader to show that 


n(a/b) = n((a — b)/b) 


using the fact that a > b for this case. (The two sequences generated 
by the Euclidean algorithm are the same once they get to a mod b and 
a—b mod 8, respectively. So s(x) s(y) = —1.) 

The other case to consider is y > 0. Then 


1 
oA SU 


It is left to the reader to show that n(y) = n(x)+1. Once again s(x) s(y) = 
—1. 

Finally, suppose that xy = 1. Then x and y are both positive or both 
negative. Thus it is sufficient to check that n(y) = n(x) +1, which is left 
to the reader. 
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